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PREFACE 

A comprehensive multispectral program devoted to the advancement of state- 
of-the-art techniques for remote sensing of the environment has been a continu- 
ing program at the Environmental Research Institute of Michigan (ERIM), for- 
merly the Willow Run Laboratories of The University of Michigan. The basic 
objective of this multidisciplinary program is to develop remote sensing as a prac- 
tical tool to provide the user with processed information quickly and economically. 

The importance of providing timely information obtained by remote sensing to 
such people as the farmer, the city planner, the conservationist, and others con- 
cerned with problems such as crop yield and disease, urban land studies and de- 
velopment, water pollution, and forest management must be carefully considered in 
the overall program. The scope of our program includes: (1) extending the under- 
standing of basic processes; (2) discovering new applications; (3) developing advanced 
remote-sensing systems; (4) improving fast automatic data processing systems to 
extract information in a useful form; and also (5) assisting in data collection, pro- 
cessing, analysis and ground truth verification. The MIDAS program applies di- 
rectly to No. (4) with its improved data processing capability. 

This document is the final report for Phase I of the MIDAS program under NASA 
Contract NAS1-11979 and covers the period from October 1972 through February 1974. 
The contract effort was monitored by Mr. William Howie of NASA-Langley. The over- 
all program is guided by Mr. R. R. Legault, a Vice President of ERIM and Director 
of the Infrared and Optics Division. Work on this contract was directed by J. D. 
Erickson, Head of the Multispectral Analysis Section and by F. J. Kriegler, Princi- 
pal Investigator. The ERIM number for this report is 195800-25-F. 

ERIM personnel who contributed to this project and who co-authored this report 
are Dempster Christenson, Michael Gordon, Roland Kistler, Seymour Lampert, 

Robert Marshall, and Rowland McLaughlin. In addition to providing the text, their in- 
dividual contributions were as follows: Dempster Christenson and Michael Gordon 
provided system programming and diagnostic software; Roland Kistler and Seymour 
Lampert provided the detailed design and performed system checkout; Robert 
Marshall aided in overall system configuration; Rowland McLaughlin organized this 
report. The authors wish to acknowledge the direction provided by Mr. R. R. Legault 
and Dr. J. D. Erickson. Outstanding contributions were made by the following persons: 
John Baumler, Clyde Connell, William Juodawlkis, Robert Pierson, Cary Wilson, 
and Nancy Wilson for their efforts in system construction. 
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MIDAS, PROTOTYPE MULTIVARIATE INTERACTIVE DIGITAL ANALYSIS 

SYSTEM — PHASE 1 

Volume I: System Description 
1 

INTRODUCTION 

1.1 SUMMARY 

MIDAS, which stands for Multivariate interactive Digital Analysis System, represents a 
breakthrough in the field of multispectral scanner image analysis by providing a low-cost capa- 
bility for user-oriented, interactive, near -real -time, digital analysis to produce thematic map- 
ping with instantaneous or multi-temporal data. MIDAS accepts data from multispectral scan- 
ners in the form of high-density digital tape, computer -compatible tapj, or analog tape, and 
makes use of proven multispectral processing techniques (including signature extension) within 
an innovative hardware approach resulting in a cost-effective, user -controlled system for 
multispectral analysis and recognition. Its hardware and software are intended to require a 
minimum amount of instructional training for successful operation. MIDAS is intended to pro- 
vide multispectral analysis for applications in disciplines such as agriculture, urban planning, 
forestry, geology, pollution detection, hydrology, and others. Features may be extracted that 
are spectral, spatial, temporal, and (possibly) polarization-dependent, thus giving a very gen- 
eral and powerful capability. 

This report describes the status achieved at the end of the first phase of a two-phase, 2-1/2 
year program to demonstrate the unique advantages of a special-purpose multispectral pro- 
cessor. In this machine the parallel digital implementation capabilities of a low-cost processor 
are combined with a mini-computer to achieve near -real-time operation of a complete process- 
ing system that includes multiple, user -selectable, preprocessing functions and color displays. 
The following sections describe the organization of the processor, its design details, its soft- 
ware design and implementation, and the further steps necessary in Phase II to make the system 
more complete. In addition, Phase II will demonstrate the effectiveness of this innovative ap- 
proach in contrast with that of using large general-purpose comp .ter s presently employed in 
the Earth Observations Program. 

Two additional volumes are considered to be a part of this final report, Volume II describing 
the diagnostics used with the system to maintain error free operation and Volume III giving hard- 
ware design details. 

1.2 A RATIONALE FOR MIDAS 

A particularly important and easily overlooked aspect of applying a remote sensing, multi * 
spectral system to aid in mapping crops, detecting pollution, or locating some ecological dis- 
turbance is that of processing the data to provide the proper information to someone in a time 
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short enough to meet his needs. Unfortunately, ongoing programs do overlook this aspect of 
the system design problem, for reasons which are rarely clear. 

Knowing the utility of these techniques, one would think that the data, gathered as it is at 
very high rates and over very large areas, must be processed before it can be useful. And when 
the time allowed to produce such results is relatively short, as it invariably is, then it becomes 
clear that a major problem exists and requires a solution. 

The magnitude of the discrepancy between the ability of a sensor to gather this data and 
the ability of a general purpose computer to process it becomes the next aspect of the problem 
to be assessed. This enormity can probably be best appreciated by considering a brief numer- 
ical example. An airborne scanner will, typically, gather data over a 20- to 30-mile flight line 
in about 15 minutes on one reel of magnetic tape. A general-purpose digital computer can 
classify this data in a time about 1000 times ?s long. Thus, the data collected in 15 minutes 
will require 15,000 minutes of processing. This would amount to 6 weeks of processing, as- 
suming a 40-hour week. Given one such computer to process this data, this aircraft could only 
be used for eight 15-minute sorties per year. Clearly, the discrepancy in capabilities is un- 
acceptable. 

At this point, one must examine the problem more carefully. We should point out that the 
above discrepancy of 1000 to 1 is actually rather conservative, possibly by as much as a factor 
of 10 but editing of the data should alleviate the problem to this extent. We should also point 
out that this form of processing yields a map containing the color-coded identification of each 
element of the scene at the time of the overflight, but also containing errors in this color- 
coding and geometrical errors resulting from the sensor and motions of the sensor platform. 
And it is important to note that the scene is not at all unchanging as one flies along: ground 
conditions vary, the atmosphere varies, the sensor stability varies and, in short, there is a 
spatial-temporal uncertainty about the data. Processing, as it becomes better understood, 
will require even more computation to remove these errors and to improve the results finally 
provided. A current rationale for this is given by Erickson [ 1]. 

From another point of view, the objective of providing greater speed also includes much 
lower cost. This is true not only for the operational situation but also for research and de- 
velopment. Given the limited resources for research, more investigations can be conducted. 

It has been estimated that processing costs in an operational system can be reduced by about 
a factor of 20 or more from some processing costs based on present day feasibility. For ex- 
ample, the typical data set described earlier could be processed at a cost of less than S400 
instead of present-day typical costs of about $8000. 

Another important objective is to facilitate control by the user. Classification of remotely 
sensed data is an interactive process in which the man and machine must, in fact, be considered 
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as the real processing system. This is not apparent in some processing 4 \ ns since the 
machine is so slow that an operator is easily able to keep pace with t' t ssk, i w system 
in which the processor is substantially faster, the time required by a operator it i.-ree or 
four times longer than that taken for processing. Increases in processor speed will then pro- 
vide little improvement in throughput. It thus becomes evident that well designed, interactive 
display and control subsystems will, in reality, offer the greatest gains in throughput. As 
should also then be evident, the development process must inherently involve an evolutionary 
refinement of the display and control interface. This, to some extent, is an experimental task 
initially addressed in the second phase of this development program. It is a task which can be 
bypassed only at the risk of seriously impaired throughput. This task also implies that the sys- 
tem should be considered as a prototype from which further refined systems may be derived 
for specific programs. To allow this user tailoring of the operational system he requires, 
modular design becomes an important aspect. 

One final motivation exists in addressing this problem: Proper configuration of an oper- 
ating system almost always requires experience in the use of a system similar to the one re- 
quired. 

One must, therefore, see this problem as one of great and, indeed, critical importance to 
the objective of making remote sensing a useful, cost-effective tool. It is thus necessary to 
pursue a course which, in the development of these methods, provides assurance that process- 
ing methods are available to meet the demands of those who would use them. 

The present system was conceived and constructed as a demonstration that a better process- 
ing system will materially assist in practical, economic realization of the benefits of remote 
sensing. 

1.3 CAPABILITY AT THE END OF PHASE I 

Phase I has resulted in the generation of the classifier hardware, an elementary software 

system, and the integration of these with the general purpose digital computer. This system, 

although far from the system envisaged for Phase 11, offers the capability of performing classi- 

-6 

fications at the rate of 5 • 10” seconds per pixel vector for a 4 -channel, 16-category (or 8- 
channel, 8-category) problem. This capability has been demonstrated using CCT input of ERTS 
data from Yellowstone National Park. 

1.3.1 HARDWARE 

The hardware system (Fig. 1) includes the classifier, the controller, and the* diagnostic 
subsystem. The classifier is that portion of the system which accepts the input data -vector and 
performs the pipeline computations ending in *he classification of the vector into one of 16 classes. 
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The controller subsystem selects the input channels to go to the classifier or to the com- 
puter, routes control and data vectors to the internal storage of the classifier from the com- 
puter, and routes data from the classifier to the compute! or displays. This also includes 
computer interfacing and its control. 

The diagnostic subsystem operates in conjunction with the classifier as a multi-port bus 
able to access several key mo uies in the pipeline to allow this data to be provided to test 
equipment or to the computer* Using this subsystem, debugging and troubleshooting can be 
done under software control, allowing fault isolation to one or two cards. 

1.3.2 SOFTWARE 

Software capability at this time includes a set of programs operating under the DEC (Dig- 
ital Equipment Corporation) DOS (Disk Operating System) monitor; among these are wire-wrap 
and diagnostic programs, input and output control programs, and analysis programs. 

The wire-wrap programs have been written to allow card input control of a Gardner-Denver 
wire-wrap machine and are currently in t.ie form of the original machine wiring, as vet un- 
changed to conform to final wiring after debugging. 

Diagnostic programs include card test programs and programs both to exercise the arith- 
metic circuitry of MIDAS and to isolate faults in MIDAS. 

The card test programs are employed to isolate component faults in the event of failure 
or to test a card after wiring and before insertion to verify its correct operation. Test jigs 
are used to interface these cards to the computer. 

The diagnostic programs include subsystem or subassembly diagnostics and an accuracy 
analysis package. These programs allow overall or subassembly test and fault isolation. 

The analysis program* allow data to be input to the computer and analyzed into multivariate 
parameters for the determination of classifier setup. These also properly scale the parameters 
to suit the arithmetic circuitry of the classifier. 

Finally, the required parameters must be output to the classifier in properly sequenced 
tables along with addresses of memory locations in the classifier. 

1.3.3 SYSTEM INTEGRATION 

Interfacing with the DEC PL^-ll 45 computer in the MIDAS System has been accomplished 
via unibus interface cards standard with DEC. The system as such has been tested using simu- 
lated data and actual ERTS data. Simulation tests verify the overall correct operation ol the 
system as an arithmetic device controllable by its software. Te *s using ERTS data have veri- 
fied that the MIDAS will accept ERTS data and properly classify it. Only a scaling problem to 
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optimize the dynamic range of the fixed-point arithmetic remains to be corrected. This cor- 
rection will serve to improve the performance for some unusual multivariate distributions. 

The MIDAS is, of course, not a full system at this point and could not be used in processing 
except in special cases: it presently requires the use of other (non-contract) equipment for dis- 
play, printing, location and editing of training sets. These deficiencies are to be eliminated in 
Phase II. 
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CONCLUSIONS AND RECOMMENDATIONS 
2.1 CONCLUSIONS FROM PHASE I 

(1) The principal objective of Phase I was to demonstrate that an all-digital classifier was 
a feasible and economical device capable of being controlled by a mini-computer and used to 
process remotely sensed data from multispectral sources. Phase II is intended to make the 
system more complete and provide the means both to process such data in a short time employ- 
ing close matching of the system and the human operator and to verify that the design is effec- 
tive for this purpose. 

(2) The basic hardware and software elements of MIDAS have been designed, assembled, 

tested and operated. The system performs as specified, accepting data in a burst mode at a 
-6 

rate of 5 ' 10 ° seconds per pixel vector and providing classified output at that rate. Some- 
what higher performance may be obtained, if necessary, but design limits have not yet been ex- 

3 

plored. As stated above, the present rate is 200 x 10 pixels/second and could probably be 

3 

raised to about 300 x 10 pixels/ second. Higher rates would require more parallelism or 

greater length in the pipelines. These approaches could allow operation, using the same com* 

6 

ponents in a redesigned architecture, out to about 10 pixels/ second. Above this level it appears 
that the parallelism, the wiring techniques, and the MSI technology would begin to create limit- 
ing problems requiring a different technology and greater cost in terms of the product of di- 
mensions and classification rate. Thus the approximate limit using the above technologies 

c 6 6 

seems to be about 16 10 channels/second, or a bit rate of 128 x 10 (8 / 16 » 10 ) bits/second 

6 6 

entering the preprocessor, or 64 x 10 (8 x 8 x 10 ) dimension, s/second. As a result, this tech- 
nology seems well suited to process data from more advanced sensors, now anticipated, such 
as EOS, etc, operating in the 100 x 10 bit/second range. The processing rate of MIDAS will be 
25.6 - 10^ (16 x 8 \ 2 x 10 5 ) bits/second in Phase II. 

It is of interest to compare this technology with others by way of an example as a means 
of assessing the performance of representative systems and the importance of some aspects 
of these systems. The times required to process an ERTS frame into 12 categories is shown 
in Table 1. The times shown are for the five major steps in the processing of data. The first 
step is the production of a graymap from the data. This can be done quickly if the data source 
is fast such as a high density tape (HDT) and a special printer such as an ink jet printer is used 
rather than a conventional tine printer. The second step, that of field location can be the most 
time consuming, and this is where the largest gain in throughput can be made. By making 
proper use of an HDT, CRT display, and ink jet display, the time required may be reduced from 
the present forty hours to four hours. The HDT can quickly load a disk with an ERTS frame 
and can display it at low resolution on a color CRT. Zoom capability and fast interactive color en- 
hancement can be used as presently set up for producing an enhanced color hard copy with an inkjet 
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TABLE 1. COMPARISON OF PROCESSING 
TIMES FOR VARIOUS SYSTEMS 

PROBLEM; CLASSIFY 12 CATEGORIES OVER AN ERTS FRAME (4 TAPES CCT) 



(Times in Hours) 

Initial Field 

Map Location 

Signature 

Calculation 

Classification 

Map 

Total 

(Less 

Location! 

7094 

2.0 

40.0 

2.0 

8.0 

2.5 

14.5 

MIDAS (CCT) 

2.0 

40.0 

2.0 

0.3 

2.5 

6.8 

MIDAS (HDT) 

0.1 

40.0-4.0 

0.3 

40 sec 

0.1 

-0.5 

360/57 

1.0 

40.0 

1.0 

4.0 

1.25 

7.25 


(Est) 
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printer. This color hard copy should greatly aid the user in field location, thus reducing lo- 
cation time to four hours. The remaining three processing steps can be seen as being accom- 
plished very quickly provided that the data source (HDT) is available. 

(3) Diagnostic methods, for both software and hardware, assume significant importance in 
successful design and operation of such a system. This facet of system design was not assigned 
great importance at the beginning of the program but became progressively more important as 

design and assembly proceeded. As a result, as assembly progressed from card to subassembly 4' 

to full assembly, software and hardware design changes were found desirable to facilitate auto- 
matic or semi-automatic testing of these levels of assembly. The final system, then, should 
be readily maintainable. This same approach should be continued in Phase II, to facilitate de- 
bugging and maintenance, and also to make available to the operator intermediate computational 
results at high speed if he desires. 

(4) Operation of MIDAS during its debugging and test phases has made it evident that a dis- 
play in color is an essential item in the system. In order to verify and estimate the perfor- 
mance of the system in processing ERTSdata,a simple C- scan black and white display was as- 
sembled from various laboratory components. This was able to effectively display output from 
one class at a time; but, for the multimodal (or multiclass) signatures, the display was clearly 
a marginal device. The moving-window color display planned for Phase II will be much more 
effective, especially when integrated into the software-controlled system. 

In summary, the objectives of the first phase of the program have been met. This includes 
demonstrating the feasibility of the digital classifier using readily available and economical 
components within the state of the art. The capability of employing a medium scale minicom- 
puter system (DEC PDP-11/45) to control and set up such a classifier has been established. 

The applicability of current technology, including an advancement such as the present MIDAS, 
to projected space-borne sensors such as EOS seems clear. The problem of maintaining a 
special-purpose system employing a combination of hardware diagnostic buses and software 
has been adequately solved. 

2.2 RECOMMENDATIONS 

(1) By reason of the established feasibility of the design approach for the classifier, the 
design of the preprocessor should employ the same technology, continuing the present design 
approach of fault diagnostic access. This is elaborated upon in Section 5. 

(2) Color display of on-line results in the form of a moving window seems necessary. 

There appears to be no other way of allowing an operator to monitor the results effectively 
during analysis and test of the processing steps. 
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(3) Also, for any high resolution data sources — especially when natural scenes or agri- 
cultural scenes containing small fields are of interest — there is a need for a high-resolution, 
color, hard copy to allow an operator to go off-line for training set selection and analysis to 
afford efficient use of the system. Analysis of current and ongoing programs to process ERTS 
data indicates that times as long as a week are sometimes required for the analyst to perform 
this function. In the best cases, this operation may require as little as one or two hours. This 
time, if spent on-line, would seriously reduce the system throughput since the operations of 
analysis and classification for typical batches of data would require only ten to fifteen minutes 
each. For many problems, however, the operator could remain on-line and obtain turn-around 
of a processing problem in times as short as one hour or as long as four hours. Determination 
of the basis for choice between an on-line and a mixed, on- and off-line mode will have to be 
made experimentally. In any event, final output of imagery will require such a hard-copy output. 

(4) Operation of the system as presently configured has indicated that system control using 
the DEC DOS system is an adequate minimum. Better access times should be sought, however, 
as a goal to obtain better interaction. This may be accomplished by switching to the DEC key- 
CRO (VT05) terminal as the console device, by employing a line printer for output and, ulti- 
mately, by adding the RAMTEK display system. 

(5) To provide wider access to the MIDAS, some consideration should be given to remote 
access to the system. This may be considered in the context of MIDAS operating in a time- 
share mode using data stored locally and providing outputs remotely or locally as a function of 
the quantity of data and the transmission costs and rates available. Time-share operating sys- 
tems under development by DEC may be applicable. 

(6) Present studies on the inclusion of spatial context and signature mixtures should be 
followed and incorporated, if possible, during Phase II design. 

(7) Documentation of the system design in APL (probably APL/360) should be seriously 
considered as the means of defining, modeling, and testing the system. This could, conceivably, 
be extended to provide wire-wrap information directly. However, the latter may be beyond the 
scope of Phase II. 

2.3 RESULTS 

The MIDAS Phase I system has been constructed and tested in two ways. First, each 
arithmetic circuit has been operated and tested from input to output to ascertain that an output 
is correctly computed from an input over the range required. Secondly, an ERTS data set from 
Yellowstone National Park has been processed to provide recognition maps of 16 categories. 
These outputs were then compared by inspection with similar maps obtained from the IBM 7094 
system. 
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The first tests have been successfully completed and verify the arithmetic design. 
MIDAS processing of ERTS data is verified by comparison with the results obtained as 
scribed in Appendix A, Performance Test of MIDAS. These results are shown in Fig. 
colors and categories used are:* 


The 

de- 

2. The 


Green - Lodgepole Pine 1 & 2, Spruce Fir 

Brown - Grasslands 1, 2, 3; Shrubs; Grass-Brush; Brush 

Orange - Light Rock 1, 2, 3 

Magenta - Dark Rock 

Black - Thermal Deposits 

Blue - Water 


These may be compared with the results obtained from the 7094 shown in Fig. 3 using the colors 
listed in Table 2.* 

The classification results have been examined and show good agreement among the cate- 
gories with the exception of "unrecognized." This category does not exist in the MIDAS map 
because the threshold was different than for the 7094, forcing recognition of all data vectors, 
and the oversight was not detected until after the classification run and printout. 

Furthermore, the 7094 color map was printed on the Mead Technology (formerly Data 
Corp.) color printer in Dayton, Ohio, and hence has a color range which we cannot duplicate 
locally. In consequence, interpretation of the results by inspection is difficult. The two re- 
sults do, however, appear to be substantially the same. Though an element-by-element com- 
parison with the 7094 results done on the 7094 computer or the PDP-11/45 would have been 
desirable, this would have required more time for software testing than was available. 

To sum up, the MIDAS hardware and software system has been completed and tested. 
Comparison with the 7094 recognition results using the same signatures indicates that MIDAS 
recognition results are substantially the same. 


♦Not all copies of this volume include color. 
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(b) Water 


FIGURE 2. MIDAS RECOGNITION MAP 
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TABLE 2, 

Map Color 

Violet 

Blue 

Cyan 

Green 

Yellow -green 
Yellow 

Yellow-orange 

Orange 

Orange -red 

Magenta 

White 

Black 


YELLOWSTONE CLASSIFICATION MAP 
COLOR LEGEND* 

Terrain Class 

Amorphous and shadowed rock 
Water 

Cc . ous forest >45% crown closure 
(L*. v pole, Ponderosa Pine, Douglas fir, 
and Spruce-fir) 

30-45% Conifer cover over grass 
15-30% Conifer cover over grass 

Grasslands 
Grass/ brush 

30-45% Conifer cover over felsic rock- 
rubble 

15-30% Conifer cover over felsic rock- 

rubble 

Felsic rock 

Thermal deposits/ snow 
Unclassified 


♦Not sil copies of this volume include color. 
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MIDAS USE AND SOFTWARE 
3.1 GENERAL CONSIDERATIONS 

In this section, we describe the use of the MIDAS system through the programs employed 
on the mini-computer to provide the user with information about and control of the processing. 
Our approach has been to keep the programs as simple, as easy to use, and as easy to change 
as possible in this first phase of development. For example, the system issues a parameter- 
input request to the user, and the responses possible to obtain the desired result at each step 
are explained. 

3.1.1 DOS MONITOR 

The operating system we use on the PDP-11/45 computer is the Disk Operating System 
(DOS). This is the standard Digital Equipment Corp. (DEC) operating system provided to all 
PDP-11/45 users who purchase a disk peripheral. 

The DOS Monitor supports the PDP-11/45 user through development and execution of pro- 
grams by providing the following: 

(1) convenient access to system programs and utilities such as the FORTRAN-IV Com- 
piler, the MACRO-11 assembler, a linking program (LINK-11), an on-line debugging 
package (ODT-11), an editor (EDIT-11), a file-utility package (PIP-11), and other 
system utility programs 

(2) input/output transfers at four levels, ranging from direct access of device drivers to 
full formatting capabilities, while providing the convenience of complete device inde- 
pendence when possible 

(3) a file system for management of secondary storage 

(4) a versatile set of keyboard commands for use in controlling the flow of programs 

The user communicates with the DOS Monitor in two ways: through keyboard instructions 
(COMMANDS) and through programmed instructions (REQUESTS). 

Keyboard commands enable the user to load and run programs; assign I/O devices or files; 
start or restart programs at specific addresses; modify the contents of memory locations; re- 
trieve system information such as time of day, date, etc, and dump core. A user can utilize 
programmed requests, which are macros assembled into the user’s program and through 
which he specifies the operation to be performed via the monitor. Some programmed requests 
are used to access I/O transfer facilities, and to specify data location, its destination, and its 
format. In these cases, the monitor brings drivers in from disk, performs the data transfer, 
and informs the user of the status of the transfer. Other requests access Monitor facilities 



\ 


! 



FORMERLY WILLOW RUN LABORATORIES THE UNIVERSITY Of MICHIGAN 


to query system variables, such as time of day, date, and system status information, and to 
specify special functions for devices. 

The DOS Monitor currently in use on the MIDAS PDP-11/45 system is version 8.8. Version 
9.0 will be used in Phase II of the program. 

A detailed description of the DOS Monitor services can be found in the ’’Disk Operating 
System Monitor Programmer’s Handbook” (DEC-ll-OMONA-A-D) [2], 

3.1.2 MIDAS OPERATING SOFTWARE OVERVIEW 

The software provided for Phase I of the MIDAS project is designed to exhibit the accuracy 
and speed of the MIDAS digital classifier hardware. Since development of the MIDAS Classifier 
hardware was of primary importance in Phase I, the software development was limited in 
available resources and concentrated mainly on developing the most effective methods of data 
transfer between the PDP-11/45 and the MIDAS hardware. Because development of the soft- 
ware and hardware took place simultaneously, the time necessary to fully implement many use- 
ful facets of control software was not available. In addition, many of the control functions not 
implemented were omitted as a result of lack of adequate hardware peripheral I/O devices 
(e.g., interactive CRT, large-size fast-access disk, etc.). Since these drawbacks will not be 
present during the implementation of Phase II, a major software thrust will be made in the 
area of human-engineering control. However, the software developed in Phase I does provide 
a basic structure for interactive multispectral classification procedures. 

All programs for the MIDAS software system were written for the PDP-11/45 with the 
following available devices: 

(1) 24K 16 -bit core memory 

(2) FP-11 floating point processor 

(3) LA-305 serial DEC writer 

(4) RK-11C disk controller and RK-05 disk drive 

(5) VT-05B alphanumeric CRT 

(6) TM-11 magnetic tape drive controller, one TU-10 7-track magnetic tape drive, and 
two TU-10 9-track magnetic tape drives 

(7) KW-11P programmable real-time clock 

(8) two DR-1 IB general device interfaces (for transfer of data to and from MIDAS hard- 
ware) 

(9) one DR-1 1C general device interface 

All program source code was entered on-line to the PDP-11/45 via the console keyboard 
and subsequently stored both on disk and magnetic tape. Most of the routines and programs 

were written for and assembled in the system assembler program, MACRO-11, although some 
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statistics routines and several diagnostic main programs were written for and compiled by the 
FORTRAN IV compiler. Routines written in FORTRAN were not empl< yed in any real-time data 
acquisition processes because of the inefficient code generated by the FORTRAN compiler. 

Current versions of MIDAS operating programs (Fig. 4) are available under user-identifi- 
cation code (UIC) 100,100 in the following files: 


1 . 

2 . 

3. 

4. 

5. 

6 . 
7. 


DISPLAY. LDA 

SIGCVT.LDA 

SIG.LDA 

SCALER. LDA 

RAM LD. LDA 

CLAS1.LDA 

CLAS2D.LDA 


- graphic display program 

- signature conversion program 

- signature extraction program 

- signature coefficient scaler 

- MIDAS RAM loader 

- classification control program 

- control program 


The general flow of operation for the MIDAS operating software is as follows: 

Initially, the classification process must obtain the spectral signatures of various types 
of ground species. In order to isolate areas from which to calculate these statistics, the user 
may use some type of grayscale display for selection of boundaries. The program DISPLAY, 
using 9-track ERTS input CCT, displays a single -channel either on the VT-05B CRT or the line 
printer for perusal by the user. From this output, we extract line and point selection for 
training areas. 


At this point, the program SIG allows the user to calculate the statistics of a signature for 
a given area, obtain via a hard-copy output device (either the console or the line printer) the 
results of the statistical calculations, test the validity of the training set statistics by employ- 
ing these statistics to classify the raw data that generated them, and decide from this displayed 
information whether to use a particular training area or to modify the areas to obtain better 
recognition. This iterative, interactive signature extraction process is critical in terms of 
classification accuracy on an unknown dataset. 

After the signature set has been selected, the program SCALER is run on the output data- 
set froir SIG to convert the calculated statistics to coefficients in a format suitable for loading 
into MIDAS RAMs. The magnitude of signature coefficients is error-checked at this level. 

With the output file from the SCALER program, the user may call upon the program 
RAMLD to select a subset of signatures for actual loading into the MIDAS Classifier RAMs. 

The RAMLD program performs the inter- signature normalization procedures necessary for 
the particular subset of signatures chosen for loading. Also, at this juncture, a maximum ex- 
ponent value is chosen as an upper bound necessary for classification accuracy. 
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Now that the classifier coefficients have been loaded into the MIDAS hardware, the user 
normally calls the program CLAS1. CLAS1, via the console keyboard, interrogates the user 
about the classification run about to be made (e.g., output devices to be used, scene boundaries, 
etc.). This information is written into a temporary disk file for use by the program CLAS2D 
which is loaded into core over the CLAS1 program, and giv a control. CLAS2D uses the in- 
to rm at ion in the temporary file, created by CLAS1, to set up the MIDAS control logic, 1 
then to actually control the transfer of data between the MIDAS Classitier, core memory, and 
various peripheral output devices. 

Ordinarily, the output of a classification run is stored on disk and also sent to the C-sean 
CRT display in a real-time mode. However, some post-analysis of the classification results 
is almost always necessary. Currently, DISPLAY provides the capability for grayscale display 
of recognition results (stored on disk) to enner the VT-05E alphanumeric CRT or to the line 
printer. 

This system obviously does not include some of the fine points in interactive digital analy- 
sis of multispectral data, such as post-analysis, pre-classification linear combination of chan- 
nels, or channel selection, etc.; but the currently implemented programs we discuss below do 
provide a framework for future inclusion of these features. 

3.2 MIDAS OPERATING SOFTWARE DESCRIPTIONS 

3.2.1 DISPLAY 

TITLE: DISPLAY - Graphic Display Program 

CSECT: None 

PURPOSE: This program allows the user to obtain a graphic display 0>oth 
soft- and or hard-copy output) of the various typjs of multi- 
spectral data encountered in recognition processing. (See F'g. 

CALLING SEQUENCE: None 

PARAMETERS: Operating parameters are input via the console keyboard in 
response to the following requests: 

1. INPUT FILE = 

This request asks for the file specification of the input data 
set. The string input must conform to CSI (Command String 
Interpreter) format as specified in Appendix D. In general, 
this format is 

DDI):NNNNNN.EEE 

where DD refers to the physical device name (in this case, 
either MT0 or VT. designating magtape unit 0 or the VT-05 
CRT, respectively). 
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FIGURE 5. FLOWCHART FOR DISPLAY (Concluded) 
21 


i 


i 






2. MAP CHANNEL = 

This request asks for the channel to be displayed by the 
DISPLAY program. This request is made only if the input 
device is magnetic tape (thus, the assumption is that raw 
data is being displayed). 

Proper response is a decimal digit specifying the channel 
to be mapped followed by a carriage return. Any other 
response causes request #2 to be reprinted on the console 
until a proper response is made. 

3. OUTPUT DEVICE(S) = 

This request asks for the output device(s) to be used in 
the display run. 

Proper responses are 'VT/ 'LP ’ or 'BOTH/ Any response 
other than the above causes request #3 to be reprinted on 
the console device until a proper response is made. 

4. LINE AND POINT NOS. = 

This request asks for the line and point numbers of the re- 
gion to be displayed. If the region entered in response to 
this request does not exist, the request is reprinted on the 
console and available line and point numbers must be entered. 
Proper response to this request is a string of six decimal 
numbers, separated by commas, with no embedded blanks 
allowed, followed immediately by a carriage return. The 
six numbers have the following significance: 

QNAS - starting line number of display region 
QNSB - ending line number of display region 
QKS - line increment 

QNA - starting pixel number of display region 
QNB - ending pixel number of display region 
QKP - pixel increment 

5. MAP LEVELS = 

This request asks that the map levels be set for the output 
display. Proper response to this request is a series of not 
more than 20 decimal numbers in the range 0 -255. These 
numbers must be separated by commas, must be monotoni- 
cally increasing, and the input series must be terminated 

with a carriage return. 
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6. MAP SYMBOLS = 

This request asks for the map symbols to be useu in the 
output display. 

Proper response to this request is a set of ch iracters, 
separated by commas. The format of this line must be 
C,C,C,C, .. , C < CR> 
where C is any printing ASCII character. 

7. NEW INPUT FILE ? 

This request determines the program branch to be take 1 
after completion of a display run for a given region. 

Proper responses are ’YES’ or ’NO/ with the first char- 
acter of each sufficient. If neither of the above responses 
(or allowable abbreviations) is made, an affirmative response 
is assumed. 

DESCRIPTION: This program provides the user with the ability to display 
two types of multispectral data in the initial implementatior 
of MIDAS software: (1) raw multispectral data and (2) class- 
ification results. 

Input is either through magnetic tape (assumed to be raw 
multispectral data) or from disk storage (assumed to be 
classification results). 

The actual display is performed on either an alphanumeric 
CRT (the DEC VT-05B) or on the line printer, or both si- 
multaneously. 

The correspondence between input data and display char- 
acter is by way of a translate table set up from the two 
vectors (which) the user input: (1) the level vector and 
(2) the symbol vector. 

The display algorithm assumes an input data value M to 
be displayed (where 0 - M - 255). Also assumes a set of 
N map levels L (such that Li < Li + 1, 0 - Li -255 and 
Ln = 255) and a set of N symbols S. Then the display 
character to be used is Si. such that L^j< M ^ Li. 

STORAGE REQUIREMENTS: 43 bytes 
SUBROUTINES REQUIRED: GETARG 

GETFIL 

VTW 
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3.2.2 SIGCVT 

TITLE: SIGCVT - 7094 to 11/45 SIG. CONVERSIONS 

CSECT: None 

PURPOSE: This program provides a software interface between the IBM 7094 
general-purpose digital computer and the PDP 11/45 - MIDAS 
system for the purpose of using signatures which were calculav. 
and extracted on the IBM 7094 to operate the MIDAS system. 

(See Fig. 6.) 

CALLING SEQLL 'ICE: None 

PARAMETERS: Operating parameters are input via the como 1 ** keyboard in 
response to the following requests: 

1. NAME OF FILE TO BE CREATED = 

This request asks for the name of the signature file to be 
created on disk. This file will be contiguous and conform 
to the standard format of a signature file as outlined in the 
STAT write-up. The format of this name must be ’NNNNNN.EEE* 
where N is a character in the filename and E is a character 
of the extension. 

2. NUMBER OF CHANNELS = 

This request asks for the number of channels from which 
these signatures have been calculated. Proper response is 
a one- or two-digit decimal number followed immediately by 
a carriage return. 

3. NUMBER OF SIGNATURES = 

This request asks for the number of signature sets contained 
in the file on the 7 -track input tape. Proper response is a 
decimal number followed by a carriage return. 

DESCRIPTION: This program takes BCD card images of signatures generated 
on the IBM 7094 computer (on 7 -track 800 BPI magnetic 
tape), converts these to ASCII and then, by using the FORTRAN 
I/O conversion routine DECODE, to binary floating-point 
representation. A file is created on disk and these converted 
signatures are stored in the signature file format (see Ap- 
pendix C). 

ERRORS: The following error message is the only one possible in the SIGCVT 
program: 

'UNIT 0 IS NOT 7-TRACK’ means that the drive which is currently 

identified as unit 0 is not a 7 -track magnetic tape drive. The 
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magnetic tape drive with the 7-track tape on it should be switched 
to unit 0 and placed on-line. All other units should be switched off-line. 
STORAGE REQUIREMENTS: 1306^ bytes 
SUBROUTINES REQUIRED: GETARG 

GETFIL 

CONV 

BCD2AS 

3.2.3 SIG 

TITLE: SIG - Signature Extraction Program 

CSECT: None 

PURPOSE: The program provides the overhead functions to transfer raw data 
from digital inputs to a subroutine STAT which calculates the sig- 
nature statistics. This program then displays the actual statistics 
and the results of classification on the raw data using the calcula- 
ted signature coefficients. (See Fig. 7.) 

CALLING SEQUENCE: None 

PARAMETERS: Operating parameters are input via the console keyboard in 
response to the following requests: 

1. GENERAL REGION LINE AND POINT NOS. = 

This request asks for the line and point numbers associated 
with a region which, in general, contains all the data to be 
used in the signature extraction. This data will be stored 
on fast randon-access disk storage for quick retrieval. 

Proper response to this request is a string of six decimal 
numbers separated by commas, the string being terminated 
by a comma. The string arguments are as follows: 

GNSA - starting line number of general signature extraction 
(GSE) region 

GNSB - ending line number of GSE region 
GKS - line increment 

GNA - starting pixel number of GSE region 
GNB - ending pixel number of GSE region 
GKP - pixel increment 

NOTE: In the current implementation, both GKS and GKP are set equal to 1 to 
avoid complications in the pixel-addressing algorithm. 
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FIGURE 7. FLOWCHART FOP SIG (Continued) 
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Output Descriptive Information 
Concerning the Statistics Cal- 
culated for the Signature, Either 




Write Signature 
Statistics and Other 
Information Into 
Disk File 

STEMP.998 (Linked) 



FIGURE 7. FLOWCHART FOR SIG (Continued) 
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FIGURE 7. FLOWCHART FOR SIG (Concluded) 
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2. SIG, LINE AND POINT NOS. AND ID = 

This request asks for the line and point numbers and ID 
(< 8 ASCII characters) for a specific signature. 

Proper response to this request is a string of six decimal 
numbers separated by commas, followed by a string of ASCII 
characters which are assumed to be .the ID associated with 
the signature. Any characters past the 8th one are ignored. 

The first six numbers describe the signature region in the 
same way as the GSE region line and point numbers do. 

The alternate proper response to this request is the simple 
character string ’DONE.’ This response instructs the pro- 
gram to proceed to request #5. 

3. EDITING ? 

This request asks if editing of the raw input data is to be 
performed. The editing algorithm to be employed is de- 
scribed in the STAT and EDIT subroutines. 

Proper response to the request is either 'YES’ or ’NO’ with 
the first character sufficing. If neither proper response (nor 
equivalent allowable abbreviation) is made, an affirmative 
response is assumed. 

4. RETAIN SIGNATURE ? 

This request asks whether or not the signature just calculated 
is to be stored on disk for later use. It is the user’s respon- 
sibility to make this decision based upon the printed output 
provided by the program. 

Proper response to this request is either ’YES’ or ’NO’ with 
the first character of each sufficing. If neither of the proper 
responses (nor equivalent allowable abbreviation) is made, an 
affirmative response is assumed. 

5. SESSION COMPLETE ? 

This request asks if the signature extraction session is com- 
pleted. If a negative response is made, the program returns 
to request #1. Otherwise, the program continues to request #6. 
Proper response to this request is ’YES’ or ’NO’ with the 
first character of each sufficing. If neither of the proper re- 
sponses (nor their allowable abbreviations) is made, an af- 
firmative response is assumed. 
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6. SIGNATURE OUTPUT FILENAME = 

This request asks for the name of the output file to be created 
to contain the data and statistics calculated for the retained 
signatures. 

Proper response is (1) a character string of format NNNNNN.EEE 
where N is a character of the filename and E is a character 
of the e Uension name, or (2) any number of blank characters 
followed by a carriage return. The latter indicates that none 
of the signature information is to be retained. 

DESCRIPTION. This program allows the user to interactively extract signa- 
tures from raw multispectral data, analyze the data, and re- 
calculate the signature if necessary. 

The user is asked to supply a general signature extraction 
region (a subset of the entire set of input data), which will be 
stored on disk. This region should be the region most fre- 
quently accessed in the calculation of the various signatures, 
because data can be accessed very quickly from the disk. If 
data for signature calculation are requested from outside 
this region, the data will be retrieved from the input CCT. 
but the time required will be much larger than the time 
necessary for comparable disk accessing. 

Once the data have been accessed, they are passed to a routine 
STAT for actual signature statistics calculation. The values 
returned to the calling program include: median vector, mean 
vector, standard deviation vector, covariance matrix, corre- 
lation matrix, upper and lower editing bounds, number of 
rejected values (based on editing criteria), eigenvalue vector, 
and the eigenvector matrix. These values are output to a hard- 
copy output device (either the console keyboard or the linc- 
printer can be assigned via the DOS Monitor ASSIGN command - 
the keyboard is the default device). 

The actual raw data used to calculate the signature are then 
classified using the MIDAS hardware, with the signature 
calculrted as the only signature. The results of this test of 
signature reliability are also printed out by the assigned 
hard-copy output. 


31 


i 


t 


i 


i 


FORMERLY WILLOW HUN LABORATORIES THE UNIVERSITY Of MICHIGAN 



Once all the signatures to be extracted have been assembled, 
they may then be stored in their final form in a disK fne spec- 
ified by the user in request #6, (see Appendix B). 

SUBROUTINES REQUIRED: STAT 

(plus all subroutines required by STAT) 


3.2.4 STAT 

TITLE: STAT - Statistical Analysis 

PURPOSE: This FORTRAN subroutine accepts a matrix of data values, edits 
out the extreme points (optional), and calculates the mean for 
each channel, the standard deviation for each channel, the median, 
covariance matrix, and the correlation matrix. If desired, the 


routine will calculate the eigenvectors and eigenvalues for the 


covariance matrix. Basically, this routine performs the actual 
signature extraction from a given set of data values. (See Fig. 8.) 


CALLING SEQUENCES: 
ASSEMBLER: 


JSR 

R5, STAT 

BR 

.^46 

.WORD 

NCHAN 

.WORD 

NSS 

.WORD 

DAT\ 

.WORD 

EDII 

.WORD 

AMED 

.WORD 

AMEAN 

.WORD 

STD 

.WORD 

COV 

.WORD 

COR 

.WORD 

EDHI 

.WORD 

EDLO 

.WORD 

NREJ 

.WORD 

NREJT 

.WORD 

IGENUE 

.WORD 

IGENTR 

.WORD 

QDEV 

.WORD 

WORK 

.WORD 

ISW 
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FORTRAN: CALL STAT (NCHAN, NSS, DATA, EDII, AMED, AMEAN, STD, 
COV, COR, EDHI, EDLO, NREJ, NREJT, 1GENUE, IGENTR, QDEV, 
WORK, ISW). 

PARAMETERS: 

NCHAN - This is the address of the integer number of channels in the 
DATA array. (Input) 

NSS - This is the address of the integer number of data values in each 
channel of the DATA array. (Input) 

DATA - This is the address of the FORTRAN data point array containing 
the data values packed two integer values per word. The array is dimen- 
sioned NCHAN + NSS and is stored in column-by-column form. (Input) 

EDII - This is the address of the floating point, single precision edit 
switch. If EDII <1, then the data in the DATA array is to be edited. 
Otherwise, no editing is performed, (Input) 

AMED - This is the address of a vector of the median for each channel 
as a single-precision, floating-point number. (Output) 

AMEAN - This is the address of a vector of the mean for each channel 
as a single-precision, floating-point number. (Output) 

STD - This is the address of a vector of the standard deviation for each 
channel as a double-precision, floating-point number. (Output) 

COV - This is the address of the FORTRAN array containing the covari- 
ance matrix for the DATA array. It is stored in column-by- column form. 
The elements of the matrix are double-precision, floating-point numbers. 
(Output). There are NCHAN elements in any row or column. 

COR - This is the address of the FORTRAN array containing the corre- 
lation matrix. Each element is a double-precision, floating-point number. 
The matrix is stored in column-by-column form with NCHAN elements 
in any row or column. (Output) 

EDHI - This is the address of the vector of the upper limit for all data 
values in each channel. Each element is a single-precision, floating- 
point number. (Output) 
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EDLO - This is the address of the vector of the lower limit for all data 
values in each channel. Each element is a single-precision, floating 
point number. (Output) 

NREJ - This is the address of the vector of the number of rejected data 
values in each channel. Each element is an integer. (Output) 

NRE fT - This is the address of the integer total number of data points 
rejected. (A data point is comprised of NCHAN data values, one data 
value in each channel.) (Output) 

IGENUE - This is the address of the vector of the eigenvalue for each 
channel. Each element is a double-precision, floating-point number. 
(Output) 

IGENTR - This is the address of the FORTRAN array of eigenvectors. 
The eigenvectors are in the columns of the matrix which is stored in 
column-by-column form. The matrix is dimensioned NCHAN*NCHAN. 
Each element is a double-precision, floating-point number. (Output) 

QDEV - This is the address of the vector of NCHAN single-precision, 
floating-point elements representing the estimate used by EDIT for the 
standard deviation in each channel. (Output) 

WORK - This is the address of the double-precision, floating-point 
vector that will be used as work space by the FIGEN subroutine. 

There should be NCHAN elements available. (Input) 

ISW - This is the address of a switch indicating, upon the call to the 
program, whether eigenvectors and eigenvalues are to be calculated. 

If ISW = -1, no eigenvectors or eigenvalues will be calculated. Upon 
return from this program, ISW indicates the following. 

If ISW = -1, eigenvalues and eigenvectors were not calculated. 

= 0, eigenvalue and eigenvector calculations were successful. 
= 1, eigenvalue and eigenvector calculations were not suc- 
cessful. 

= 2, editing procedure edited out all data points except 
perhaps one. 

DESCRIPTION: If editing of the data base is requested (EDII < 1), the upper 
and lower bounds for data values in each channel are chosen 
so that no more than one out of a thousand are rejected. 
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These editing bounds are based on the median (as an estimate 
of the mean) and the quartile value (as an estimate of the 
standard deviation). The actual standard deviation used for 
each channel appears in QDEV, and the upper and lower editing 
limits for each channel appear in EDHI and EDLO respectively. 
After the editing procedure, all the data points that were within 
the editing limits in all channels are the first NSS-NREJT data 
values in each channel. The remaining data values in each 
channel were rejected because at least one of the corresponding 
data values in another channel was not within the editing Hmits. 
If fewer than two points are considered good after the editing 
procedure, ISW is set to 2 and the subroutine returns. If there 
are two or more good points after the editing procedure, the 
mean and median are determined for each channel. Then the 
covariance matrix is calculated using the following equation: 


COV(K, L) 


NESS 

NESS-1" 


1 


NESS 


NESS 

1=1 


DATA(K,I)*DATA(L,I) 


- MEAN(K)*MEAN(L) 


where COV = floating point, double -precision covariance matrix 
NESS = the integer number of good data points 
DATA = the integer array of data values 

MEAN = the floating point, single-precision vector of the mean for 
each channel 

The standard deviation for each channel is calculated by extracting 
the square root of the diagonal elements of the covariance matrix. 
The correlation matrix is determined next through use of the follow- 
ing equation: 


COR(K,L) 


COV(K,L) 

STD(K)*STD(L) 


if STD(K)*STD(L) = 0, COR(K,L) = 1. 
where COR = correlation matrix; the diagonal elements will be all l f s and 
all off-diagonal elements are between 0 and 1 
COV = covariance matrix 

STD = vector of the standard deviation for each channel 
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If the user has requested the eigenvectors and eigenvalues for the 
covariance matrix (ISW * -1), these are calculated and the subroutine 
returns control to the calling program. 

ERRORS POSSIBLE: All possible error returns are described under the 
parameter definition for ISW. 

SUBROUTINES REQUIRED: 

EDIT - This routine removes all data pints that have a value outside 
certain limits in any channel. 

SEARCH - This routine is used to calculate the median data value for 
each channel. 

LTI * This unpacks the data values from the data matrix. 

EIGEN - This routine calculates the eigenvectors and eigenvalues of 
thi covariance matrix. 

FTNLIB - This library contains the FORTRAN utility subroutines, 
STORAGE REQUIREMENTS - 2552 octal locations 


3.2.5 SCALER 
TITLE: 


SCALER 


CSECT: SCALE 

PURPOSE: This program calculates all the coefficients that need to be loaded 
into the MIDAS memory. The coefficients are calculated, scaled, 
and shifted so that MIDAS will receive the greatest number of 
significant bits. These coefficients are then written into a file 
with a directory so that the MIDAS memory loading program 
(RAMLD) can select the set of signature coefficients it requires. 

(See Fig. 9.) 

CALLING SEQUENCES: None. This is a main program. All information needed 

comes in response to typed requests on the 
keyboard. 

1. PLEASE TELL CSil YOUR OUTPUT FILE AND YOUR 
INPUT FILE(S) 

This requests the user to type the name of the file where the 
scaled signature coefficients are to be put and the name of the 
file(s) where the signature statistics to be scaled are stored. 

In addition, the user must specify a switch telling th scaler 
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Put Determinant in 
Output Buffer and 
Write Output Buffer 
into Scaled File 


< 3 - 


Set Up to Read Data 
for Next Signature 



FIGURE 9. FLOWCHART FOR SCALER (Concluded) 
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what to do when nonfatal errors are encountered while scaling. 

The format of the response should be acceptable to the command 
string interpreter. 

DV: OFILENAME. EXT [UIC] <DV: IFILENAME. EXT [UIC] /SW 

where DV: = the device where the following file is stored. If not 

specified, it defaults to the disk. 

OFILENAME. = the six -character name of the output file. 

IFILENAME. = the six-character name of the input file. These files 
must be contiguous files with at least two 256-word 
blocks. The filename is followed by a period. 

EXT = the three-character extension name for the preceding 
filename. 

[UIC] = the sign-on ID (surrounded by brackets) under which 
the file was created. May be left out if the same ID 
as the user's. 

/SW = the two-character switch preceded by a slash which 

tells the scaler what action to take when nonfatal errors 
are encountered while scaling the input file(s). 

/SW=/CO = tells the scaler to continue processing whenever a non- 
fatal error is encountered. The directory entry for 
the bad signature is removed and the next signature 
statistics are obtained. 

/SW=/ST = tells the scaler to stop processing whenever a non- 
fatal error is encountered. The directory entry is re- 
moved for the bad signature. 

2. SCALING COMPLETED 

This indicates that all of the input files have been scaled and the directory 
has been written out for the scaled file. 

Algorithmic Description: 

Input Format 

The format for each input file is as follows (disk file assumed): 

BLOCK 1 - 

WORD 1 - integer # of signatures in the file 

WORD 2 - integer # of channels. This must be the same for all 

signatures in all files being scaled at one time (NCHAN). 

WORD 3 - integer # of words per signature 
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INTEGER 


WORD 3 + 255 
256 


- # of blocks per signature in input file 


BLOCK 2. - 

WORD 1-4-8 ASCII character signature ID 
WORD 5 - (W-l) - vector of signature means with NCHAN 
double-precision, floating point elements 
WORD W- - the upper triangle of the symmetric, square covar- 
iance matrix. Each element should be a double - 
precision, floating point number. There must be 

+ - • NCHAN elements. This format is 
repeated with each signature file. 

where W = NCHAN* 4 + 5 


PROCESSING: 

MEANS: 

The means used by MIDAS must be negated and scaled so that no 
more than 8 bits in two’s complement notation are used. That is, 
the high-order bit (bit 7) is used as a sign. Therefore, the smallest 
number possible is: 

10000000 base 2 - -128^ 

and the largest number possible is: 

01111111 base 2 = +127^ 

The numbers coming from the signature file are in offset binary no- 
tation with the smallest integer number represented by 
00000000 base 2 = 0^ 

and the largest integer number represented by 
11111111 base 2 = 255^ 

The incoming double-precision, floating-point mean is negated. The 
constant 128^ is then added to the result to convert it to two’s com- 
plement notation. This number is converted to integer and put in the 
output buffer. If an input mean is * 256, an error message is printed 

VARIANCES: 

The standard deviation for each channel is determined by calculating 
the square roots of the diagonal elements of the covariance matrix. 
The standard deviation is inverted, scaled, and converted to integer. 
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MIDAS presently has only 12 bits available for this number. There- 
fore, to obtain as many significant bits as possible, the standard de- 
viations are subjected to the following restriction: 


1 < a 

With this restriction, MAX [1/a] =1.0-2 

0.111111111111 


-12 


In binary this is: 


This provides the maximum number of significant bits with no bits 
constantly 0 or constantly 1. If a standard deviation is less than 1, 
the standard deviation is altered to make it just enough greater than 
1 so that 

i=1.0-2' 12 

Consequently, the processing procedure is as follows: 

1. A diagonal element of the covariance matrix is found, and its square 
root is calculated. If no more, quit processing. 

2. If the result is greater than 1, go to step 3. Otherwise, multiply 
the diagonal element by 

1 


(1 - 2' 12 )*ct 


Then multiply the column of the upper diagonal covariance matrix 
(except the diagonal element itself) by 

1 


(1 - 2* 12 )*ff 


Then go back to step 1 and select the same diagonal element again. 
If a = 0, a is set equal to 1 


3. Multiply this number by 2 


1 - 2 
12 




I * 2 12 

CT 

4. Convert the result to integer and save it. 

5. Go to step 1. 

See write-up on STDDEV. 
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RAM COEFFICIENTS: 

To understand the reasons behind all the following mathematical 
gyrations, see the mathematical analysis later in this section 
(see also Section 4). The processing goes as follows: 

1. Calculate the correlation matrix from the covariance matrix. 

2. Transpose the correlation matrix. This matrix must be trans- 
posed because the next routine to use it was written in FORTRAN 
and expects all arrays to be stored in column- by -column form 
instead of row- by -row form. 

3. Calculate the eigenvalues and eigenvectors using the FORTRAN 
subroutine EIGEN. 

4. Transpose the FORTRAN-form eigenvectors’ matrix. 

5. Treat the eigenvalues as the diagonal element of a square matrix 
with all off-diagonal elements set to zero. 

6. Calculate the square root of each diagonal element and invert it. 

7. Multiply this transformed matrix by the eigenvectors’ matrix and 
store the results back in the eigenvectors’ matrix. 

8. Find the largest element in absolute value in the new matrix, 
negate it, divide every element of the new matrix by it, and 
save the constant. 

9. Multiply the new matrix by 128 and convert it to integer. The low- 
order 8 bits are the only significant bits that MIDAS can handle, 

10. Take the constant saved previously, square it, and store it in the 
output buffer following the coefficient matrix (stored in row-by- 
row form). 

See RAMCOF writeup for a more detailed explanation of RAM 
Coefficients. 

DETERMINANT: 

This program does no scaling on the determinant. The determinant 
is found for the covariance matrix by performing a Gauss-Jordan 
inversion with partial pivotal elimination. The double-precision, 
floating point natural logarithm is calculated and stored in the output 
buffer. 

ERRORS: (See CSI parameter /SW above) 

1. ERR1,1. INPUT FILE IS A LINKED FILE. TRY AGAIN. 

All files used by the scaler must be contiguous files. 

2. ERR1.2. INPUT FILE DOESN’T EXIST. TRY AGAIN. 
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3. ERR2. NCHAN £ 0 

NCHAN must be a positive integer number greater than 0. [FATAL] 

4. ERR3. NCHAN NOT THE SAME FOR ALL SIG FILES 
NCHAN was difierent for two input signature file header blocks. 
[FATAL] 

5. ERR4. NUMBER OF SIGNATURES £ 0 

The integer number of signatures entered in a signature file header 
block has a value £ 0. [FATAL] 

6. ERRS. MORE THAN 50 SIGNATURES FOR SCALED FILE. 

The program can put only 50 scaled signatures or less in the scaled 
file because only one block is allowed for a directory. The header 
information for 50 signatures takes up the entire directory block. 
The scaled file is complete to and including the 50th scaled signa- 
ture. [FATAL] 

7. ERR6. MEAN * 256. 

Mean values must not be greater than or equal to 256. [NONFATAL] 

8. ERR7. VARIANCE 2 1.0. 

The value of l/(a) was too large for MIDAS. The number has been 
scaled again and the covariance matrix has been altered. Process- 
ing always continues. This is just a warning. 

9. ERR8. EIGEN SUBROUTINE DIDN’T CONVERGE. 

The routine for calculating the eigenvalues and eigenvectors of the 
correlation matrix was unable to process the data given it. 
[NONFATAL] 

10. ERR9. EIGENVALUE < 0 

If a negative eigenvalue is found, then the original correlation 
matrix was not positive definite. This might indicate a data input 
error. [NONFATAL] 

11. ERR10. COVARIANCE MATRIX IS SINGULAR. 

The determinant of the covariance matrix was found to be zero or 
almost zero. Therefore, its natural logarithm cannot be calculated. 
[NONFATAL] 

12. ERR11. DETERMINANT < 0. 

The determinant of the covariance matrix was less than zero. 

This is mathematically impossible if the covariance matrix is 
positive definite. [NONFATAL] 
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13. ERR12. SCALED FILE TOO SMALL FOR REQUESTED SIGNATURES. 
The scaled signature file is not big enough for all the signatures 
requested for scaling. The signature number -1 indicates the num- 
ber of signatures actually scaled. [FATAL] 

14. ERR13,1. KEYBOARD ERROR 

There was an 10 error. The status word is printed from the line 
buffer header. [FATAL] 

15. ERR13,2. SCALED FILE TRAN BLOCK ERROR 

There was an output error. The status word is printed from the 
scaled file tran block. [FATAL] 

16. ERR13,3. SIG FILE TRAN BLOCK ERROR 

There was an input error. The status word is printed from the 
signature file tran block. [FATAL] 

17. ERR14. SIGNATURE FILE TOO SMALL FOR REQUESTED SIG- 
NATURES 

The number of signatures declared in the header block for a sig- 
nature input file was larger than the file. The signature number 
printed minus 1, is the number of the signature scaled last. 

[FATAL] 

18. ERR15. STANDARD DEVIATION = 0 

A standard deviation was found equal to zero making it impossible 
to calculate a complete correlation matrix. [NONFATAL] 

19. SIGNATURE 3 = XXXXX 

Signature number XXXXX is the number of the signature currently 
being processed. This is relative to the first signature of the first 
input file. This is printed after an error message. 

EXTERNAL CONSTANTS DEFINED: (all double-precision floating point numbers): 
DFLT0 - 0. 

DFLT1 - 1. 

DFLT4 - 4. 

DFLT8 - 8. 

DFLT2 - 2. 

DFLT27 - 128. 

DFLT29 - 512. 

DFLT99 - -1.0E20 
DFLT18 - 0.125 
DFLT97 - 32768. 

DFLT96 - -2.44140625E-04 
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SUBROUTINES NEEDED: 

STDDEV - this calculates the standard deviations and the 1/a vector 
EiGEN - this calculates eigenvectors and eigenvalues 
RAM CO F - this calculates the coefficient matrix to be loaded into the 
MIDAS RAMs 

INVERS - this routine inverts a matrix and calculates the determinant 
of the original matrix at the same time 
SEEKFL - this routine finds the starting block # for any given filename. 

It also indicates whether the file is contiguous or linked. 
DSUB - calculates the floating point, double precision displacement 
of any element in a square matrix 

DSUB2 - calculates the floating point, double precision displacement 
of any element in an upper triangular matrix 
CORREL - calculates the correlation matrix from a given covariance 
matrix and standard deviation vector 

DEVICES REQUIRED: KEYBOARD input 
KEYBOARD output 

fDisk signature file input 
If not specified to CSI < 

[Disk scaled file output 

MACROS REQUIRED: .INIT, .WRITE. .TRAN. .CSI1, .CSI2, .WAIT, .RLSE, 
.EXIT, .READ, .BIN2D, .BIN20 

STORAGE REQUIREMENTS: 21322 octal locations 

M athematica l An alysis . This system performs a maximum likelihood decision 
and assumes a multimodal Gaussian multivariate dis- 
tribution. The function computed in the classifier is 

pr(X) = - K 72 ~T / 2 EXP l -1/2 (X-;i) T 

2 7T 1*1 

where X = input data vector 
4 = mean vector 
= variance - covariance matrix 
1*1 = determinant of ti 

Taking the natural logarithm of both sides 

fn [pr(x)j = -l/2(X-u) T <f '(X-n) + fn irtf 1/2 + Cn( 2 ff ' N/2 ) 
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The decision rule is 
pr(X ) 

— rz^\ if ^1 choose B; -1 choose A 
pr( Xg) 


or 


fn [pr(X A )| < fn [pr(X 0 )j choose B 


Substituting on both sides yields 


-i • pfc' * <* h 

-i/*x-n B ) T ’<g(x-B B ) . — £f4 • - af 1 

( LjZ ) 


(n y 


-1/2 


- 1/2 


Choose B. 


After multiplying through by -2 

(X -' i A )T ' , A (X ’' i A ) + ( TaI < lX-M B ) T rfB(X-|i B )+ (n|-^ B | 


The covariance matrices and ^ are both available, but the values 
within the matrices vary over a wide range. This is particularly a prob- 
lem with MIDAS because of its limited bit capacity. It would lie desirable 
to compress the same information from the covariance matrix into a 
matrix whose values did not vary so widely and would require a minimum 
bit capacity to represent all but the extreme points of the matrix's full 
range. 

This covariance matrix scaling can be achieved by use of the following 
substitution. 

I'M * KpiM 

where o = diagonal matrix of standard deviations 

p = correlation matrix with l’s on the uiagonai and numbers be- 
tween 0 and 1 off the d 1 ^gona* 

Inverting •) yields 


T| 

r 

- 1 

T 

M 

p \ 

i 

La 


Substituting into fn[pr(X)j 


(X-/i A ) 1 (X-4 ) 

- + fn |*». 


(X-4 b )‘ 


-1 * X 'P 


B 


B 


fn !"b! 


47 



2jp 


rORMERL* WILLOW RUN LABORATORIES THE UNIVERSITY OF MICHIGAN 


The term still has a wide range, but the following coarse restriction 
will eliminate those X values too far from the mean to be considered for 
classification. 

This equation can be simplified further by decomposing p *. This de- 
composition is achieved through eigenvalue analysis. The solution cf the 
following equation produces the eigenvalues and their associated eigen- 
vectors. 

px = xx U) 

where X = an eigenvalue 
X = an eigenvector 

also 

p T Y = XY 

where Y = an eigenvector 

X remains the same because the characteristic equation remains 
the same. 

For example 

det [p T - XI] = det [p - M] 

The set of X vectors and Y vectors that satisfy the relations in Eq. 1 are 
called the U and V eigenvectors’ matrices respectively. 

U is associated with p 

T 

V is associated with p 

T 

X is the same for p and p 

The individual eigenvectors appear in the columns of both matrices. Since 
p is a symmetric matrix about its diagonal in our case, then U and V are 
the same matrix. Throughout the following analysis, V may be replaced 
by U with no change to the result. 

It can be proven that 

U T V = 1 and V T U = I (2) 

Let us create a new matrix 

A = XI 
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where I = the identity matrix with Vs on its diagonal and zeros every- 
where else. 

A = a diagonal matrix with the eigenvalues on its diagonal and 
zeros everywhere else. 

It can also be proven that 

pU = UA 

p T V = VA 

A T = A 

therefore, from Eq, (3) 

T T 

pUV = UAV 

From Eq. (2) this dissolves into 


p = uav a 

Also, from Eq. (4) 

p T VU T = vau t 

From Eq. (2) this dissolves into, 

p T = vau t 

Rewriting Eq. (1) 

pX = >X x> pU = A 

p‘*pU = p ' 1 AU 
U =p _1 AU 
p^U = A _1 U 

Therefore, U appears to be the eigenvectors matrix for p 1 as well as p. 
The only change is that the eigenvalues for p * are the reciprocal of the 
eigenvalues for p. Finally, the result appears for which we have all 
been waiting. 

From Eq. (5), 

p-'.UA-V 
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and from Eq. (6), 


[p 1 )' 1 = VA^U 1, (8) 

Since A is a diagonal matrix, the inverse of A is simply the reciprocal of 
each diagonal element. Also 


k -l 


= A‘ 1/2 x A 


• 1/2 


Therefore, from Eq. (7), 

. DA' 1 ' -A^V 

and from Eq. (8), 

(P T ]-' . VA- 1/2 * A->/V 

Substituting Eq. (9) back Into £n [pr(X3] and replacing the V matrix with 
the U matrix (because U and V are identical) yields 


(9) 

(10) 


rx-u 


U.A-^xA- 1 ^ 
A 


<e 


-M 


nT 


B 


U„A->/ 2 X A-’/ 2 
13 


L’bJ 

The only difference between 
IT 


•m 
■$?] 


+ £n 0 


+ fn 0. 


Bl 


and 


. 0 J 


,- 1/2 


UA 


• 1/2 




is that the former yields a row vector and the latter yields a column vec- 
tor. The respective elements are identical. Therefore, the same result 
can be achieved by calculating 

A-i/Vjkaj 


The vector elements can then be squared. The sum of these squares yields 
the same result as performing the matrix multiplications indicated on 
both sides. 
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Therefore, all the scaling program has to do is calculate the following: 

-/i a vector of negated means 

- = a vector of variances 
T 

U the transpose of the eigenvector matrix (the eigen- 
vectors end up in the rows, assuming they were 
originally in the columns) 

- 1/2 

A = a diagonal matrix with the square root of the reciprocal 
of each eigenvalue on its diagonal with zeros every- 
where else. 

-1/2 T 

A U = the matrix of coefficients to be put into the memory 
of the MIDAS Classifier 

For a more complete description of the eigenvalue analysis applied here, 
see Ref. [3]. 

3.2.6 RAMLD 

TITLE: RAMLD - COEFFICIENT LOADER 

CSECT: None 

PURPOSE: This program allows the user to select a set of signatures from 
a disk file (generated by the SCALER program) and load the ap- 
propriate coefficients into the random-access memory (RAM) of 
the MIDAS Classifier hardware in preparation for classification 
of multispectral data. (See Fig. 10.) 

CALLING SEQUENCE: None 

PARAMETERS: Operating parameters are input via the console keyboard in 
response to the following requests. 

1. ENTER INPUT SIGNATURE FILENAME = 

This requests the filename and extension for the file to be 
used as input. This input file must be created by the SCALER 
program to insure proper data format. The proper response 
format is ’NNNNNN.EEE' where N represents a character 
in the filename (blanks are significant characters) and E 
represents a character in the extension. (See Figs. 11, 12, 
and 14.) 

2. SIGNATURE NOS. TO BE USED = 

This requests the selection of a subset of signatures from 
those present in the file specified by the response to Request 1. 
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FIGURE 10. FLOWCHART FOR RAMLD (Concluded) 
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$ L0 60, 40 
t»RT E : -22-FEB-74 
TIME -00. 03 21 
> RUN RAHLD 


CLASSIFIER COEFFICIENT LOADER ** V. 1. 0 ** MFG ** OCT IS >77 
FNTER INPUT SIGNATURE FILENAME * VNP . SCL 


SIG NO 

ID 

00001 

LODGE D1 

00002 

LODGE D2 

00003 

SP FIR 

00004 

GR 3 

00005 

LT RK 3 

00006 

GR 2 

00007 

LT RK. 2 

00808 

LO SHRUB 

00009 

GRLANDS 

00010 

GR/BR 

03011 

BRUSH 

00012 

DK RK 

00013 

LT RK 

00014 

THERMDEP 

09015 

WATER 


SIGNATURE NOS. TO BE USED = 1, 2. 3, 4, 5 , 10, 11, 13. 15 
ENTER CHI-SQUARE VALUE (0 - 1000) = 1QOO 
$ 


FIGURE 11. COEFFICIENT LOADING OF A SUBSET OF SIGNATURES FROM FILE YNP.SCL 
Note that in a computer printout, user-supplied responses will be underscored. 
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< RUN RAHLD 

CLASSIFIER COEFFICIENT LOADER ♦ * V. i. 0 ** MFG ** OCT 1973 
ENTER INPUT SIGNATURE FILENAME = VNP . SCL 


SIG NO. 

ID 

00001 

LODGE D1 

00002 

LODGE D2 

00003 

SP FIR 

00004 

GR 3 

00005 

LT RK 3 

00006 

GR 2 

00007 

LT RK 2 

00008 

LO SHRUB 

00009 

GRLANDS 

00010 

GR/BR 

00011 

BRUSH 

00012 

DK RK 

00013 

LT RK 

00014 

THERMDEP 

00015 

WATER 


SIGNATURE NOS. TO BE USED = ALL 
ENTER CHI-SQUARE VALUE <0 - 1000) = 1£» 
$ 


FIGURE 12. COEFFICIENT LOADING OF ENTIRE SET OF SIGNATURES FROM FILE YNP.SCL 

USING "ALL” FEATURE 

Note that in a computer printout, user-supplied responses will be underscored. 


55 


\ 


—.J. 






} 


T 


T 





1 


1 


1 





formerly willow run laboratories the university of Michigan 


Input is made via a call to the subroutine GETARG, and thus 
the input string must follow the format necessary for input 
to the GETARG routine (see write-up on GETARG routine). 

This input will be a set of decimal-numeric strings (sepa- 
rated by commas) specifying the signatures to be loaded. 

(See Fig. 11.) If al! of the signatures are to be loaded, a prop- 
er response to .mis request would be the 3-character string 
’ALL.* (See Fig. 12.) 

3. ENTER CHI-SQUARE VALUE (0-1000) = 

This requests the input of a value for the chi-square test of 
the exponent in the classifier. A proper response to this 
request is a decimal number in the range 0 to 1000 of 1 to 4 
characters. 

DESCRIPTION: This program is designed to use as input a signature file cre- 
ated by the SCALER program, such that a set of 4- or 8 -channel 
signature coefficients are loaded into the MIDAS Classifier 
hardware RAMs. 

There are five sets of coefficients to be loaded into MIDAS 
for each signature. (See Fig. 13.) (1) a mean vector \i, 

(2) a (1/a) pseudo -variance vector V, (3) a square eigen- 
vector matrix U, (4) a normalization coefficient K2, and (5) an 
additive constant LN, corresponding to the natural logarithm 
of the determinant of the matrix U. 

The functional flow of this program can be obtained from the 
accompanying flowchart. Certain aspects, however, will now 
be described in more detail. 

(1) The normalization of the K2 values is performed by 
the subroutine NORMAL. This routine divides each 
K2 element by the maximum of all the K2 elements 
and then converts this floating-point value to a 12-bit 
unsigned integer, such that 0.0 corresponds to integer 
0 and 1.0 corresponds to integer 7777^. These 12-bit 
integers are passed to the RAMs of the MIDAS 
hardware. 

(2) The translation of the LN values is performed by sub- 
tracting the minimum LN from all of the LN values. 
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FIGURE 13. FLOWCHART FOR LOADER 
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(3) The addressing scheme which is used to pass co- 
efficients to the proper RAMs for MIDAS is described 
in Appendix A. 

ERRORS: The following error messages are possible: 

(1) ♦♦♦ERROR - INPUT FILENAME NON-EXISTENT 

This error is caused by a non-existent filename being entered as a 
response to Request 1. Check the format of response to Request 1 
or use PIP* to check for proper filename. 

(2) ♦♦♦ERROR - INPUT FILE LINKED 

Since the output from SCALER is always a contiguous file, the file- 
name specified in response to Request 1 must also be contiguous. 

(3) ♦♦♦ERROR - SIGNATURE NO. TOO LARGE 

In the response to Request 2, a signature number greater than the 
number of signatures available from the input file was entered 
(Fig. 14). Signature numbers will have to be reentered. 

(4) ♦♦♦ERROR - ILLEGAL CHARACTER IN STRING 

In the response to Request 2, the format necessary for input to the 
routine GETARG was not followed. Signature numbers will have 
to be reentered. 

(5) ♦♦♦ERROR - NO. OF SIGNATURES > 16. 

More than 16 signatures were selected to be loaded into the MIDAS 
RAMs (Fig. 14). In the 4-channel case, 16 is the maximum number 
of signatures; whereas, in the 8-channel case, only 8 signatures 
may be loaded. 

(6) ♦♦♦ERROR IN DR-UB-DRST IN SWR 

A hardware malfunction occurred in the DR-11 B used to transfer 
data to the MIDAS hardware. The DR-11 B status register is in the 
console display register. Diagnostics on the MIDAS and DR-11B 
hardware should be initiated. 

STORAGE REQUIREMENTS: 10376 8 bytes 

SUBROUTINES NEEDED: GETARG 

SEEKFL 

NORMAL 


*PERIPHERAL INTERCHANGE PROGRAM - DOS system program for 
file manipulation and maintenance. 
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f RU RAHLD 

CLASSIFIER COEFFICIENT LOADER ** V. 1. 0 ** MFG ** OCT 1977 
ENTER INPUT SIGNATURE FILENAME = VNP . SQL 


SIG NO. 

ID 

00001 

LODGE D1 

00002 

LODGE D2 

00002 

SP FIR 

00004 

GR 3 

00005 

LT RK 2 

00006 

GR 2 

0000? 

LT RK 2 

00006 

LO SHRUB 

00009 

GRLAND5 

00010 

GR/BR 

00011 

BRUSH 

0001? 

DK RK 

00013 

LT RK 

00014 

THERMDEP 

00015 

WATER 


SIGNATURE NOS. TO BE 'JSED = 1, 2, 3, 4, 5, 16, 10, 11 
*** ERROR - SIGNATURE NO. TOO LARGE 

SIGNATURE NOS. TO BE USED * 1 , 2, 7 . 4. 5, 6, 7, 6> 9 , 10, 11, 12 , 17 , 15, 1, 2 
*** ERROR - TOO MANV SIGNATURE NOS. 

SIGNATURE NOS. TO BE USED * 1, 2, 7 , 4, 5, 6, 7 £, 9, 10, 11 , 1 2, 13, 14, 15 
ENTER CHI -SQUARE VALUE <0 - 1000) r 100g 

$ 

FIGURE 14. COEFFICIENT LOADING WITH INPUT ERRORS FLAGGED BY PROGRAM 
Note that in a computer printout, user-supplied responses will be underscored. 
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3.2.7 CLAS1 AND CLAS2D 

TITLE: CLAS1 - Pass 1. Classification 

CSECT: None 

PURPOSE: This main program establishes the parameters of the classification 
run to be made, performs the system setup necessary, creates an 
intermediate disk file describing the system setup performed, and 
turns control over to the module CLAS2D for the actual classifi- 
cation run. (See Figs. 15. 16.) 

CALLING SEQUENCE: None 

PARAMETERS: Operating parameters are input via the console keyboard in 

response to the following requests: 

1. IS OUTPUT TO DISK DESIRED ? 

This question establishes whether an output file of the classi- 
fication results is to be created on the disk during the classi- 
fication run. The only proper negative request is ’NO’: any 
other response is assumed to be affirmative. If the response 
is negative, Requests #2 and #7 (below) will not be made. 

2. ENTER OUTPUT FILENAME = 

This request establishes the name of thi file to be created 
to contain the results of the classification. Proper format 
for the response is NNNNNN.E EE where N is a character in 
the filename proper, and E is a character in the file's ex- 
tension. 

3. ENTER TYPE OF CLASSIFY INPUT - 

This request establishes the type of input which will be fed 
through the classifier. System setup is performed depend- 
ing on the type of input mode. Proper responses are either 
'DIGITAL' or 'ANALOG/ with the first character sufficing 
for either. NOTE: In Phase 1, the analog data input mode 
has not yet been implemented: thus the digital data mode is 
assumed, after printing out the warning: 

ANALOG CLASSIFY NOT IMPLEMENTED ** 8-OCT-73 
DIGITAL INPUT ASSUMED 

4. MOUNT DIGITAL INPUT TAPE ON UNIT 0 (9-TRACK) 
ENTER < CR> WHEN INPUT READY 
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FIGURE 15. FLOWCHART FOR CLAS1 (Continued) 





FIGURE 15. FLOWCHART FOR CLAS1 (Concluded) 
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FIGURE 16. FLOWCHART FOR CLAS2D (Concluded) 
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This request informs the operator that the input tape should 
be mounted on magtape unit 0, on a 9 -track drive. Proper 
response when the tapes are ready is any string of char- 
acters followed by a carriage return (no preceding char- 
acters are necessary). If no drive 0 has been established, 
an 'A002 XXXXX' error warning message will be printed out 
on the console. When the error condition has been corrected 
(i.e.. the input tape mounted properly on unit 0), the proper 
response to continue is CO’ followed by a carriage return. 

5. ENTER SUBTITLE = 

This request establishes a subtitle or secondary descriptor 
for the particular data-set to be classified in the upcoming 
run. Proper response is a string of up to 72 characters ter- 
minated by a carriage return. 

6. ENTER LINE AND POINT NOS. ^ 

This request establishes the rectangular boundaries of the 
scene to be classified. This scene is usually some subset of 
the entire input data set. Proper response to this request 
is a sequence of six numbers separated by commas, where 
the six numbers represent the following: 

NS A - the starting line (record) number 
NSB - the ending line (record) number 
KS - the line (record) increment 
NA - the starting pLxel number 
NB - the ending pixel number 
KP - the pixel increment 

For example, an input string of 100, 1000, 10, 150. 800, 

3 would indicate a subset of data starting with the 100th 
data record on the tape and ending with the 1000th data 
record: reading only every 10th record, then, the records 
used would be 100, 110, 120 980, 990, 1000. Fur- 

thermore, for each data record, the pixels (resolution 
elements) used, start with number 150 and go througn 798, 

i.e., 150, 153. 156, 159 792, 795, 798. Note that 800 

was not used because, according to the increment KP, point 
801 would be the next logical pixel — but this is greater 
than the upper point limit of 800. 
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If an improper response is given, no error message is printed 
out; however, the request line is reprinted on the console, 
and proper input should be made. Improper inputs consist 
of other than six numbers in the string or a non- numeric 
character other than a comma. 

7. EXPONENT TO DISK DESIRED ? 

This request establishes whether or not the exponent calcu- 
lated for each pixel classification is to be stored along with 
the actual 5-bit classification result. If so, a full word (16 
bits) will be stored on disk per pixel input and, therefore, 
classified; the high-order 11 bits will be the exponent value, 
whereas the low-order 5 bits will be the classification result. 

Otherwise, if the exponent is not desired, classification re- 
sults will be packed 2 to a word, with bits 0-4 representing 
the classification result for pixel n and bits 8-12 represent- 
ing the classification result for pixel n+1. 

Proper responses are ’YES’ or ’NO' with the first character 
sufficing for either. A negative response is assumed if a 
response other than the two above (or an allowed abbreviation) 
is made. 

ERRORS: An error is possible if output of classification results to disk stor- 
age is desired. If the amount of contiguous disk storage necessary 
to store the entire scene selected by the line and point number input 
is not available, a warning ’FILE STORAGE LIMITATION - REDUCE 
SCENE’ will be printed on the console. A branch will be effected to 
the point in the program where request #6 is made. At this point, 

one may either reduce the size of the data-set by reducing the line j 

and point numbers, or by not storing the exponent value. j 

DESCRIPTION: This program is used to establish the classification parameters \ 

of the run to be made using the MIDAS hardware. No actual * 

hardware initializing is performed in the program; however a 
temporary disk file is created containing the descriptive pa- 
rameters entered from the console by the operator. After these 
parameters have been placed in the disk file 'currently always 
named D0000X.999), the moJule CLAS2D (or CLAS2A when 
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analog classification is fully implemented) is loaded into mem- 
ory and executed, thus replacing CLAS1 as the controlling 
program. 

STORAGE REQUIREMENTS: 3350 g bytes 

SUBROUTINES NEEDED: GETFIL 

GETARG 

TITLE: CLAS2D - Pass 2: Classification (Digital) 

CSECT: None 

PURPOSE: This program, using system descriptive information obtained 
from a disk file created by CLAS1. performs actual hardware 
setup, initiates the classification, and controls 10 functions 
on a line-by-line basis for both input and output of data. This 
includes data transfers directly to the MIDAS Classifier either 
from or to memory, and also the transfer of data to memory 
from magnetic tape and transfer of data (classification results) 
to the disk. 

CALLING SEQUENCE: None 

PARAMETERS: General system descriptors obtained previously by CLAS1 

are stored in a file D0000X.999, which is used only for 
transmission of information between CLAS1 and CLAS2D. 

In addition, if disk output of classification results was re- 
quested during CLAS1, additional verification of disk output 
is requested by the following: 

IS OUTPUT TO DISK DESIRED ? 

Proper response to this request is either ’YES' or ’NO’ with 
the first character sufficing in either case. An affirmative 
response is assumed if neither of the two proper responses 
(or an allowed abbreviation) is given in response to the 
request. 

DESCRIPTION: This program provides all of the control of data transfer to 

and from the MIDAS Classifier, and between memory and 
peripheral data storage devices, i.e., magnetic tape and 
random-access disk. 
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Control of data transfer is performed on a line-by-line 
basis; no buffering of input data is performed because the 
tape drive is moving at its maximum rate with unbuffered 
tape input. 

Data transfers to and from the MIDAS hardware are 
interrupt-driven — i.e., the conclusion of a transfer in or 
out of MIDAS via the two DR-11 B general device interfaces 
causes an interrupt through a unique hardware location for 
each data transfer. 

Output to disk is buffered into a region 9720^ 16-bit words 
long, making output to disk necessary much more infrequently 
than every line. 

Initially, the MIDAS hardware is set up by the transfer of 
various appropriate command words via the DR-11C general 
device interface. These MIDAS commands are detailed in 
Section 6.1. 

ERRORS: The following error returns are possible: 

1. FATAL DISK ERROR - RETRY COUNT = 5 DISPLAY REGISTER 
CONTAINS DRST 

This error indicates that a hardware error on the RK-05 disk drive 
makes impossible the correct transfer of data to disk. The write 
operation, though attempted 5 times, was unsuccessful. The initial 
disk block number of the attempt write operation is placed in the 
console display register for operator information. The classifica- 
tion run is then tprminated and return is made to DOS MONITOR 
command mcde via the .EXIT EMT call. Diagnostics on the RK-05 
disk drive should be initiated immediately by qualified software 
system personnel. 

2. DR-11B #n ERROR - DISPLAY CONTAINS DRST 

The contents of the appropriate DR-llB’s status register (DRST) 
are placed in the console display register for operator information. 
The classification run is then terminated and return is made to 
DOS MONITOR command mode via the .EXIT EMT call. Diagnostics 
on the appropriate DR-11 B should be initiated immediately by 
qualified software personnel. (See Fig. 17.) 
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n = 1 indicates a malfunction in the DR-11 B used to transfer input 
data to the MIDAS Classifier. 

n = 2 indicates a malfunction in the DR-11 B used to transfer classi- 
fication results back to the PDP-11/45 core memory. 

STORAGE REQUIREMENTS: 57612g bytes 

SUBROUTINES REQUIRED: COMAND 

BN 2 BCD 
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4 

SYSTEM CONFIGURATION 

4.1 APPROACH 

A special purpose computer used for recognition (classification) of remotely sensed ob- 
jects may take several forms. In general, the system design indicates that a general purpose 
computer be used for statistical analyses and for control and monitoring of the special pur- 
pose machine. The special purpose machine could be implemented using analog or digital 
techniques. 

Analog techniques for classifying objects using multispectral data have been well devel- 
oped over the past few years and, until very recently, appeared superior in cost and complex- 
ity to digital techniques in which a special purpose mechanization would be used. One of the 
mos* significant cost -components, the multiplier, was previously estimated to cost on the 
order of several hundred to a thousand dollars, and would be replicated, in the worst case, 
about 300 times. Two significant changes have taken place: (1) the cost per 8-bit multiplier 
has declined substantially, and (2) the speed of such units has increased to a range of 70 to 
200 nsec product. As a result, the multiplier can be built to be time-shared in computing 
several products per input sample (5 psec) so that a factor of cost reduction may be had 
ranging between 5 and 10. Labor costs for wiring the medium scale integration ALUs (Arith- 
metic Logic Units) rather than the small scale integration dual in-line packages (DIPs) can 
be reduced by significant factors also. The classifier in this configuration, then, has the 
same organization as the previously operated analog system (SPARC) but has its functional 
components implemented in time-shared digital circuitry. Each component acts as a computing 
element in a hard-wired sequence, which may be thought of as a pipeline or cascade of com- 
putational circuits. The desired processing rate is comparable with the SPARC system, that 

is, multispectral data are processed at a rate equal to that at which it was collected, i.e., 

5 

2 x 10 resolution elements per second. 

6 

An ERTS frame (8 x 10 resolution elements) can be classified, element by element, in 

about 40 seconds provided that the data is recorded in a form such tha' it can be supplied at a 
5 

rate of 2 x 10 pixels/second. This is, of course, not the case for CCTs (Computer Compatible 
Tapes) presently supplied which can only allow rates one -seventh to one -twentieth of those 
possible with high density tape. 

The computation this system performs is a maximum -likelihood decision, assuming a 
multimodal Gaussian multivariate distribution. This assumption has been well justified by 
more than 100 experiments using multispectral data at ERIM f 4 ] by a similar number at 
LARS and, as time goes on, by more and more experience at NASA and other agencies. Al- 
though simpler algorithms can perform well for some data-sets, a significant percentage of 
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applications demand this powerful decision rule. No penalty in speed and only a small addi- 
tional cost occurs in using this algorithm, hence it is employed. 

Thus, the basic calculation to be performed is 

fn \pr(X)} 

where X is the input data vector and the probability density function is a Gaussian density 
function: 

fn {pr(X)} - - 

The 

(X -M) T fl -1 (X -M) = o (11) 

is a quadratic calculation in which M is the mean vector for each distribution and •’ is the 
inverse of the variance -covariance matrix. This computation must be performed for each 
object cluss included in the classification process and the number of computational steps in- 
creases as the square of the number of channels. Therefore, for one MIDAS Classifier con- 
figuration, the number of channels is limited to eight, anu the number of object classes is 
limited to eight, while the same hardware switched to another configuration limits the num- 
ber of channels to four but increases the allowable number of object classes to sixteen. This 
implementation uses about 40 digital multipliers. This same size classifier, if implemented 
as a hybrid processor, would have required about 210 multiplying D/A converters. Since the 
cost of each was comparable, the choice was clearly in favor of an all-digital classifier with 
characteristics as summarized in Tables 3 and 4. 

4.2 ORGANIZATION 

The Phase I MIDAS System can be visualized most readily if organized into subsystems 
as shown in the block diagram of Fig. 18. The system is under complete control of the 
Digital Equipment Corporation (DEC) PDP-11/45 computer system. All ccntrol inputs are 
made by an operator via the computer keyboard. All commands are translated by computer 
software into code % ords sent out over interface devices to set up the hardware registers in 
the special purpose processor. These codes are decoded in three of the blocks shown in 
Fig. 18: (l) in the Control section, (2) in the Clock section, and (3) in the Diagnostic /Output 
section. The codes will be described in detail in subsequent sections. 

High-speed mass data transfer to and from the compute; takes place from (1) the A/D- 
D/A (hybrid) section, (2) the Clock section, and (3) the Diagnostic /Output section. High-speed 


\\(X -M) T 0" 1 (X 


M) + fn |0 I + n fn 




(i) 
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TABLE 3. ADVANTAGES OF DIGITAL PARALLEL CLASSIFIER 

Uses current state-of-the-art digital techniques 
Less costly than current hybrid techniques 
Complete repeatability in setup and performance 

Computer -controlled diagnostics easily implemented for error-free operation 
Throughput equal to current analog hybrid techniques 


TABLE 4. CHARACTERISTICS OF PROTOTYPE PROCESSOR 

All-digital parallel classifier under computer control 
Classification rite of 200,000 data vectors/second 

Classification of data vector into one of eight stored categories for eight -channel data 
Classification of data vector into one of sixteen stored categories for four -channel data 
Potential of expansion to sixteen-channel capability 
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FIGURE 18. BLOCK DIAGRAM OF THE PHASE I MIDAS SYSTEM 
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multichannel data transfer takes place through the hybrid section and the quadratic computa- 
tion Hardware. 

4.2.1 QUADRATIC PIPE COMPUTATION 

The quadratic pipe computers perform the quadratic calculation given by Eq. (11): 

O = (X - M) T S _1 (X - M) 

This calculation can be expressed in a number of ways to optimize the computation. Since 
*he number of bits in the special-purpose classifier is limited, it is desirable to express the 
quadratic calculation such that the calculated result has a very limited range. The variance - 
covariance matrix 9 can be expressed as 

M-MMM (i2) 

where [a] is a diagonal matrix of the standard deviation, and [p] is the correlation matrix 
with all l's on ^he diagonal and values of 0 to 1 off the diagonal (in some cases negative 
values may occur). Taking the inverse of (12) yields 

i»i' 1 -[;]rpr 1 [j] <«> 

Substitution of Eq. (13) into (11) results in 

" 4 ) 

The terms (X - M)/cr can have n very wide range. However, if the r..age 

-8 < (X. - M.)/o. < 8 (15) 

l l i 

is exceeded, the value of X for that channel is too far from the mean to oe considered for 
classification. Truncation of significant bus will occur and a flag is set indicating this con- 
dition. This indication is used to reject a decision that the sample is from the particular 
class. 

The computation of Eq. (14) could proceed in a straightforward manner, but can be sim- 
plified somewhat due to the symmetry of the correlation matrix and its inverse. This simpli- 
fication can be accomplished in more th m one way. One method is as follows: 

Q = [Y] T [Y] = [^M] T fBl T [B][^] (16) 

where B is an upper triangular matrix formed by the decomposition of the inverse p matrix. 
By calculating 
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Terih 


[Y] = f B] ] (17) 

the final matrix operation is simply 

Q,rYl T fY] (18) 

U 1 

where the y. are the elements of the [Y] vector. 

There are four steps implied by Eqs. (16-18). These are: 

(1) Subtract the mean from each channel. 

(2) Multiply each result by 1/cr. 

(3) Perform the Y matrix multiplication on each result of Step to get Y’s. 

(4) Square each resulting Y and add the results together. 

Another method for calculating I (14) is to express the inverse of the correlation ma- 
trix p"* in terms of its eigenvalues and eigenvectors. From Lanczos [3] we can express the 
correlation matrix as: 


p = UAU T (19) 

T 

where the U matrix is comprised of eigenvectors arranged in columns, and U is its trans- 
pose. The A matrix is the set of eigenvalues on the diagonal. Taking the inverse of the cor- 
relation matrix, it can be shown that 

p' 1 = UA^U 1 (20) 


which is to simply take the reciprocals of the eigenvalues and multiply by the two original 
eigenvector matrices. One further decomposition brings us to the desired form 

p’ 1 = [U(A*V 1/2 |-[(A'V 1/2 U T | (21) 

Substitution of Eq. (21) into Eq. (14) and defining (A *) = A yields 


Q - 



-l/2u T 



( 22 ) 


In this case, if a vector Y is defined as 
y , A’ 1/2 U T [~^] 


(23) 


and computed as such, then the final matrix operation can be performed in the same manner 
as in Eq. (18). The hardware required for this second calculation must perform more multi- 
plication *han in the first method. However, in order to implement the first method, a more 
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elaborate switching scheme is needed to avoid multiplying by a large number of zeros. The 
details of this switching scheme were not worked out and only general consideration was 
given to it. Since it appeared desirable to have the flexibility and capability to do the matrix 
multiplication required by either Eq. (17) or (23), the hardware was designed to do full matrix 
multiplication. Tests were run on over 100 signatures to see if the resultant set of coeffi- 
cients for either method appeared better suited to a limited -word -length multiplier. From 
these results it appeared that a slight advantage might be gained by using the second method. 

A block diagram of the quadratic pipe is given in Figure 19. The actual computations 
are performed by a set of time -shared arithmetic units arranged in a sequence allowing a 
set of 16 operations to be executed in less than 5 fisec before the outputs are latched. Each 
stage supplies its results to a subsequent stage for further processing. Precision varies 
from 8 to 14 bits as the data progresses through the quadratic pipe and acquires greater 
significance. 

Internal arithmetic operations are currently performed at less than 50% of manufacturer’s 

rated component speeds. For example, an 8 x £ bit multiplication is rated at one product 

-9 -9 

every 135 x 10 sec, but is currently being employed to produce a product every 270 x 10 

_9 

sec. This rate may be increased to one product in less than 200 x 10 sec while maintaining 
adequate derating if the system operates at higher speeds in later design stages. 

4.2.2 SCALING 

The calculation of the quadratic form given in Eq. (11) above can result in a very large 
number. Fortunately, we are not interested in the exact result for large values and, there- 
fore, can restrict the range, provided that an overflow condition is detected. The scaling of 
the quadratic pipe is summarized in Figure 20. All arithmetic operations up to the input of 
the squaring circuit are 2’s complement arithmetic. The input data is an 8-bit word having 
the range of values 

-128 < x < 127 

We are interested in small values of X - /i, but an overflow condition can be detected which 
allows us to keep the resultant to eight bits. Following addition, a multiplication by 1/a is 
performed. A restriction on the range 

-8 < Z-Ui < 8 
a 

is placed on the output of this multiplication. This in effect limits any data channel value to 
less than eight standard deviations from the mean. It is to be noted here that a lower limit 
on a is required and consequently an upper limit on 1/a also. The limit selected is 
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FIGURE 20. BLOCK DIAGRAM OF CLASSIFIER SCALING 
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cr - 1 +(1, 2048) 


or 

1 a < 1 - (1 2048) 

which appears to be a reasonable choice based upon examining a number of signatures. Also, 
it would seem reasonable that variation in the data for a given object class would show changes 
in the two least significant bits. However, for ERTS data there is some recent evidence that 
this is not always the case. 

The above two restrictions determine which high -order bits can be tested for overflow 

3 7 

and discarded at the output of the 1/a multiplication. The five bits from 2 to 2 are tested 
for an overflow condition. If an overflow is detected, an overflow bit is put into and clocked 
down a shift register, which is oarallel to the pipe, in step with that data value, so that at the 
final test for recognition that calculation is discarded. 

The input to the eight matrix multiplier cards is comprised of the sign bit and the seven 
2 -4 

bits from 2 to 2 from the variance card. It is to be noted that this quantity (X - [i)/o has 

zero mean and variance of one when the input data ( v ) originates from the same distribution 

as that us d to calculate (i and a . Similarly, after the matrix multiplication is performed 

ia the eight matrix multipliers, the y outputs have zero mean and a variance l/k where the 

-1/2 T 

constant k is the largest value in the A U matrix. This normalization of the matrix 
is done in order to get maximum utilization of the 8 -bit RAMs and their associated multi- 
plier input. Empirical study of more than 100 signatures indicates that the value of k is 

from a little over 1 to about 5. The output of the matrix multiplier is tested for overflow on 
3 4 5 

bits 2,2, and 2 of the 12 -bit adder used to accumulate the matrix multiplication result, 
but these bits are not used as input to the squaring circuits. 

The constant k which was factored out in the matrix multiplication is reintroduced in a 
final multiplication stage. Since there is a different k with each object class, the set of k’s 
can be normalized with largest k factored out. This again makes maximum use of the limited - 
bit multiplier and RAM. 

4.3 COMPUTER SUBSYSTEM 

The general-purpose machine is a disk -based Digital Equipment Corporation (DEC) 
PDP-11/45 configured with 24K of core, three tapes, serial printer and keyboard -CRT. The 
system software is built around the disk operating system to provide fast access to the oper- 
ating programs and available languages. 

The DEC PDP-11/45 computer system is a fast, medium -size minicomputer, whose 
architecture allows extremely flexible interfacing with non-standard I/O devices. This ease 
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of interfacing and speed were the primary reasons for use of the PDP-11/45 computer as a 
controller for the input and classifier hardware used in multispectral data recognition 
processing. 

The hardware peripherals available in the ERIM configuration are, in detail: 

(1) The PDP-11/45 central processing unit (CPU), including the floating-point processor. 

(2) An RK-11C disk controller and an RK-05 disk with interchangeable cartridges, each 
containing up to 1.2 million 16-bit words. Average total access time is 90 millisec- 
onds, Data transfer rate is 1M microseconds per 16-bit word, 

(3) 24K of 16 -bit core memory with a cycle time of 850 nsec, access at 350 nsec (450 
nsec at the UNIBUS). 

(4) An LA-30 DECwriter data terminal, with a character set of 64 symbols at speeds up 
to 30 cps. Output is generated by a 5 x 7 matrix. 

(5) A VT-05 alphanumeric display terminal with a CRT display and communications 
hardware capable of data transmission at rates up to 300 baud in full or half duplex 
modes. 

(6) A TM-11 magnetic tape drive controller, two 9 -track TU-10 magnetic tape drives, 
and one 7 -track TU-10 magnetic tape drive (read/write speed of 45 ips). Densities 
available are 200, 556, and 800 bpi for the 7 -track and 800 bpi only for the 9 -track 
drive. 

(7) Two DR-11B direct memory access devices for transmission of data between an ex- 
ternal device and memory via the UNIBUS, without a need for continuous control by 
by CPU. 

(8) One DR-1 1C device interface for transfer of data between a user device and memory 
via the UNIBUS. 

(9) One KW-11P programmable real-time clock, providing programmed real-time in- 
terrupts and interval counting in several modes of operation. 

Figure 21 shows the configuration in use at ERIM. 

Sof f ware available on the PDP-11/45 system comprises the Disk Operating System (DOS), 
Version 8.8, supplied by DEC. Under this operating system, MACRO-11 (a Macro assembler), 
FORTRAN IV, EDIT -11 (a file editing package), ODT-11R (an on-line debugging pack i ^e», 

PIP (a file modification and transfer package), and several other utility packages are 
provided. 

All output is either through the LA -30 DECwriter, on magnetic tape, or printed with a 
line printer. Input for large quantities of data is via magnetic tape or through the keyboard 
devices. 82 
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FIGURE 21. PD P-1 1/45 CONFIGURATION 
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SUMMARY OF PHASE II SYSTEM DESIGN 

The MIDAS System is to be developed further during the second phase of the two-phase, 2-1, 2 - 
year program by accomplishing several principal objectives; (1) the addition of a pre-processor: 
(2) adding more peripherals, such as a color, moving-window display, a high density tape inter- 
face and, if possible, a hard-copy color output; (3) developing conirol and operating software to 
provide a simple interface to the user; and (4) study of several possible uses for the system in 
order to evaluate its overall potential. 

This portion of the • eport describes the projected system, complete with both hardware 
and software, as planned for the end of the second phase. 

5.1 SYSTEM DESCRIPTION 

The MIDAS System, at the end of the first phase of the program, is a minimal operating 
system for demonstrating the feasibility of on-line, near-real -time digital multispectral analysis 
and recognition using a parallel, low-cost processor combined with a minicomputer. These ob- 
jectives have been met. 

The second phase is intended to further develop the system by adding hardware devices and 
a software operating system to obtain a powerful, flexible, and easily controlled system for ana- 
lyzing and processing multispectral data at high data rates from a variety of sources. The data 
rate may be as high as 200,000 data vectors per second if the data source permits. Since a 
large number of functions (or operations) are to be performed by the system, these functions 
may perhaps be best summarized by a narrative presentation describing individual functions as 
a user would normally employ them in the process of operating the system with real data. There 
are, of course, many different sequences a user may wish to follow in processing multispectral 
data through the system. These cannot all be described in detail, but the more important or 
more likely sequences will be discussed or mentioned in passing as the sequence is elaborated. 

5.1.1 ATYPICAL PROCESSING SEQUENCE 

The user normally enters the processing sequence with a set of data (some ERTS tapes for 
example), has some imagery and collateral information about the scene, and wants to obtain addi- 
tional imagery showing the location, identification, and possibly the spatial extent of certain nat- 
ural or cultural features of interest to him. He will also be interested in the reliability of the 
classification process, the feasibility of separating and analyzing various features in the scene, 
and, perhaps, in obtaining some summary statistics about the spatial or spectral characteristics 
of particular features. 
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His problems may range from finding answers for research questions to accomplishing 
some quasi-production task. Thus he may be interested in the feasibility of deterr^ning the 
volume of water in the Everglades, on the one hand, to obtaining the projected yie d of corn in 
the state of Iowa, on the other. The MIDAS System is flexibly designed to do useful processing 
in a multiplicity of such applications. 

5. 1.1.1 Setup 

Two things are done in this initial mode. First, the user must perform a routine signal con- 
ditioning of the data. Since the classifier input is only eight bits, the accuracy of classification 
may be affected if the dynamic range of the signals is too small. Thus the signal conditioning 
will consist of a search through a representative set of the data to pick off, with hardware, the 
minimum and maximum value in each channel (assuming that the data is in straight binary for- 
mat). The minimum value is stored (negatively) as one input to an adder on each channel. The 
maximum value is used to compute an integer value 1 through 7 for input to a multiplier. These 
operations on the data can be performed with simple hardware and will condition the data so that 
the 0 to 255 dynamic range available is well used. Although such operations may be handled in 
conjunction with angle corrections, it appears desirable to keep them separate (to avoid a book- 
keeping problem) as explained below. It is planned for Phase II that all data-sets stored in the 
computer will be raw data . All signal conditioning constants and preprocessing constants, either 
computed previously or to be computed, will be stored with the data. These can then be selects 
and used to transform the raw-data set (a training set, for example) into data input to the signa- 
ture analysis. 

As a second operation in the setup mode, the user must locate features and areas of the 
scene about which he has information; this allows their designation as training or test areas for 
the system. He may do this off-line with printouts which enable him to designate the spatial 
address of his data and then supply this address to the system. Or, beginning with no print- 
out, he may call for a color display of the complete area, zoom down to a full resolution sub- 
area of the original frame, and then designate to the sy 'em the training and test are^s via an 
interactive display and appropriate cursor placement. To perform this search and locate such 
areas in minimum time requires quick search capability for a large amount of data (about 30 
million bytes). The quickest way would be to have the data stored on a large disk; the next 
quickest is to have the data stored on high-density digital tape (HDT). However, if the data are 
stored on computer -compatible tape, a high-speed tape drive (125 ips) should be available to read 
the data onto the disk. 

But what if the unenhanced display does not provide enough contrast to allow the ooerator to 
locate these areas? In this case, the operator may choose functions of the input data, combining 
data and colors to enhance certain differences among the features of interest. Upon selection, 
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this data may be supplied to memory as training sets. Optionally, the operator may make a color 
copy of this enhanced imagery as a record for reference or off-line study. 

If its quantity is large, the data which an operator desires to use for further analysis or 
classification would, at this point, normally be placed on high density digital tape (HOT). (The 
use of HDT as input in all digital data cases may be desirable to lighten the raw data nandling 
load on the computer by placing this load in the MIDAS front end.) 

5. 1.1. 2 Analysis Mode 

When the user has isolated data from areas of the scene as described above, he must then be- 
gin an analysis to define the methods and parameters necessary in pre-processing to obtain the 
parameters needed for classification. If we assume, just to illustrate the options, that he requires 
all the analysis capability of the MIDAS, the complete setup can be described as follows: (1) 
calculation of scan-angle correction functions, additive and multiplicative: (2a) selecting and 
calculating channel ratio transforms, or (2b) calculating a linear dimension reduction transform: 
and (3) calculation of classifier parameters. These options or steps are performed in the se- 
quence given above since each step provides both improved data for the next step and successively 
enhanced data for classification. Not all of these steps need be performed. Note also that it is 
not possible to take linear combinations of ratios. 

The calculation c f scan-angle correction functions (step 1, above) is normally required in 
processing aircraft data and, Tor spacecraft data, may be needed to remove path radiance. 
(Processing of S- 192 data from Skylab indicates 10-20°; variations in data amplitude as a func- 
tion of scan angle for some channels.) The calculation may be done in either of two ways. In 
one, the complete data-set is analyzed in strips of scan-angle increments to obtain averaged 
variations of data as a function of scan angle for multiplicative corrections and to look for 
darkest objects at various scan angles in order to correct for path radiance. The other way 
is to choose the training set data in such a manner that it is distributed over the scan angle 
range of the scene. From these sets of data, a set of correction functions may be calculated. 

The training set data is then transformed with these functions before the next st jp in pro- 
cessing — i.e., that of calculating ratio transforms. 

Ratio transforms (see step 2a above) are used in pre-processing when one wants the data to 
be relatively insensitive to illumination variations, where transference of signatures from one 
frame to another is desired, or where spectral features can be enhanced. There is no clear-cut 
means of examining a data-set to decide, in an a priori manner, which of several possible trans- 
forms is needed for a particular scene. The method normally used would be to perform each of 
several transforms on the training sets and then test the resulting data for its optimal probability 
of correct classification; here the training set and test set data enable selection of the transform 
to be employed. When this selection has been made, the properly transfer <j data is used for 
the next step calculation of a dimension-reducing linear transform. 
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The purpose of the linear transform (step 2b, above) is to provide a rearranged set of data 
in which the spectral information is combined in such a manner that the transformed data has 
a greater discriminability per dimension for the classes of interest to the user. The classifier 
can then perform a classification operation using the smaller number of dimensions with an 
accuracy of classification equivalent to that obtainable with a larger number of untransformed 
dimensions. The classifier may then make more classifications on the data-set than it does on 
the untransformed data. The operator can, in effect, use a single transform on every data vector 
to obtain a set of transforms of smaller size within the classifier, thus increasing the classifier’s 
net capacity. (Because there are several approaches to calculating such a transform, a specif- 
ic method has not as yet been selected; this uncertainty, however, in no way affects the design 
of the hardware.) 

Finally, after these transformations have been effected on the training data, the classifier 
parameters are calculated (step 3) and supplied to the classifier. This completes the MIDAS 
setup for classification. 

5. 1.1.3 Verify Mode 

Often the user will wish to defer an immediate start on the classification of a large area of 
data, preferring instead to evaluate the classification operation on the basis of a set of smaller 
test set areas. These are stored in the comp iter early in the setup operation. 

The data from these smaller test set ureas are called up, supplied to MIDAb, and then, 
after classification, input to the color display. Here they are displayed in one of two modes: 
single-field or checkerboard. The single-field mode .hows the classified results for one field 
at a time as a true ima^e of the field in which anomalies can be detected based on their shape 
and location in the field. The checkerboard mode presents several fields on the display at one 
time: these are located in a matrix of sub-areas arranged to make optimum use of the display 
area. 

The display that performs this task should have its own storage for refreshing the CRT dis- 
play. The alternative of loading classified results at 200,000 decisions per second into the com- 
puter over the Unibus, while at the same time refreshing the display, does not appear possible. 
Much lower data transfer rates can be achieved by having the memory in the display, along with 
a scrolling capability. In this way display updating can be achieved by simply reading in the new 
scan line, thus keeping display-data transfer rates low and greatly helping to maintain the fast 
classification-result transfer rate. 

Tlu operator-user may examine the results to verify that the machine is correctly set up and 
that t h <i discrimination among classes is adequate. Possibly he may call for a color print of 
these displays to examine the results off-line before releasing the system. 
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5. 1.1. 4 Class i fy Mode 

A user normally enters this mode by supplying his bulk data into MIDAS on an HDT, al- 
though he may optionally supply data from analog tape, computer-compatible tape (CCT) or 
from disk core. As data enters the front-end. proper channels are selected and gated into the 
preprocessor. The data is then corrected by the preprocessor functions and transformed as 
specified. The data stream out of the preprocessor is immediately input to the classifier where 
it is classified and displayed as rapidly as it enters the system. The user may pause at any 
time and examine a portion of the area on the color display to obtain an intermediate evaluation 
of the classification process. The output codes may be logged into the computer in three ways: 
element-by-element, summary by class for each line, or totalized results for each class for the 
entire run. These codes may be examined at any time either via printout or on t lie display. 

The color display may be dumped to a color printer at any time during classify mode opera- 
tion — though normally, the user would examine the full area once before obtaining a color print. 
Printing takes about 50 seconds per frame. 

5. 1.1.5 Diagnostic Mode 

The preprocessor and processor contain a bus system which allows the computer to access 
the data being processed at principal points in the cascade of arithmetic operations. This feature 
which has been included to facilitate fault isolation, particularly in bringing the system up after 
manufacture — allows examination of intermediate results which are very often of interest to a 
user operator during the processing sequence. 

These intermediate results are available as an image displayed or printed out in color and 
can be selected at any time under computer control. Thus, the user is afforded considerable 
flexibility in taking advantage of any subsets of arithmetic operations done in the system. 

5. 1.1. 6 Operator L og 

All of the steps taken by an operator user in this sequence of operations are maintained in a 
system log which becomes his record of the processing operation. Items to be included are all 
analytic results, training and test area data, a record of operator choices, the sequence of steps 
taken, and any pertinent ancillary data. Tins record is to be kept on the operator’s tape by the 
user operator. 

5.2 SYSTEM HARDWARE 

The overall system hardware is diagrammed in Fig. 22. The MIDAS System consists of 
several principal subsystems: the general purpose computer (DEC PDP-11- 45) (see Fig. 21), 
the classifier (see Fig. 19), the control subsystem, a hybrid subsystem, and the preprocessor. 
Associated peripheral equipments are the color display, HDT tape recorder, analog tape recorder 
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and a color printer. Of these, the PDP-11 45, the classifier, the control subsystem, and the 
hybrid subsystem have been designed, fabricated, and tested during Phase I of the program. 

The remaining units to be fabricated, procured and interfaced during Phase II are indicated in 
Fig. 22 as cross-hatched whole or partial blocks. Among these are the preprocessor, color 
moving-window display, HDT and its interface, the color printer, and some computer peripherals 
such as a disk and a line printer. A more detailed description of these subsystems and peri- 
pherals is given below. 

5.2.1 CLASSIFIER 

The clr * sifier will be expanded from the present eight -channel, eight-class configuration 
to an eight-channel, sixteen-class configuration. The hardware implementation of this expan- 
sion will be essentially identical to the present classifier except for possibly the final output 
circuitry. The expanded classifier will operate at the present rate of 200,000 decisions per sec- 
ond. One change from the present design may have to be made in the computation of the sub- 
traction of the mean. Currently this operation is an 8 -bit integer subtract, but may have to be 
modified to a floating point operation to allow dynamic range expansion in the divide process of 
the preprocessor described below. 

5.2.2 PREPROCESSOR 

The preprocessor first performs the several functions described in Section 5. 1.1.2: cor- 
rection of data as a function of angle, calculation of ratios of spectral bands, and calculation of a 
linear transform of the corrected data. A block diagram of the subsystem is shown in Fig. 23. 

Data are passed to the angle correction port as a data vector, X. For each angular increment, 
this vector is multiplied, element by element, by a function of the scan angle. This function may 
be supplied in one of two ways: (1) by computing each function of each angle using a fifth -order 
polynomial calculation in which the coefficients alone are stored and the functions of 0 f or a given 
angular increment are calculated; or (2) by pre-computing all functions of angle and supplying 
these into a local memory, thus requiring only access to these stored constants followed by 
multiplication. The sequence of multiplications will occur at a rate of 300 nsec per multiply 
and, for 16 input signals, require 5 psee to complete. 

The corrected data then enter the ratio calculator which consists of a fast divider and a set 
of 16 adder-subtracters. These arithmetic elements operate in a sequence controlled by micro- 
program vectors stored in a semiconductor random access memory (RAM). Any generalized 
sum and difference functions of sixteen channels can be performed. In the block diagram of 
Fig. 23 the a^ and b i take on the values zero, one, or minus one. Eight of these calculations are 
placed on a bus to form the numerator input to the divider; the remaining eight are bussed to the 
denominator input from which eight ratios will be formed. A division technique that appears 
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promising for this function will consist of a table-lookup and subtraction process done in a 
pipeline manner. An unnormalized, floating-point numerator and denominator are entered 
into the divider and updated every 312 nsec. The quotient of the first pair appears 5 /isec (16 
clocks) later as a 12-bit floating-point number. 

The linear transformation will consist of eight of the multiplier-summer cards operating 
on each of the 16 channels to provide eight transformed vector elements. A certain amount 
of further study is needed in this connection to determine the effectiveness of the transformed 
vector as a classifier input vector. Preliminary studies [5] indicate that for some representa- 
tive recognition problems, no more than 4 transformed channels may suffice to describe data 
from 10 spectral bands. In this case the preprocessor could be restricted to a 4-dimension 
output and the classifier used in the 4-channel, 16-class mode. The preprocessor as now planned 
will supply an 8-dimension output and accept a 16 -spectral-band input. 

The preprocessor, then, can accept 16 or fewer spectral bands from sources such as \ky- 
lab (S- 192), ERTS, Bendix M 2 S, NASA-Houston-MSDS scanner, or the ERIM M-7 scanner. The 
24-channel Houston-MSDS scanner will require two passes to enter training sets, followed by 
selection of 16 out of 24 channels for MIDAS processing. In each case the output will be an 8- 
dimension vector to the classifier. 

It is to be noted that the preprocessor will not perform linear combinations of ratioed data. 
Since the ratio is a floating-point number, hardware will have to be designed and developed for 
this more complex arithmetic. However, by working with 8-bit integers coming from the angle 
correction, the classifier circuitry developed in Phase I can be used directly. This limitation 
is not too severe when one considers that ratioing of sums and differences is a dimensionality- 
reducing operation, as is the case with linear combinations. Furthermore, since the outputs of 
these two preprocessing operations are bussed together, a mix of ratioing and linear combina- 
tions can be obtained, thus increasing preprocessor flexibility. 

5.2.3 COLOR MOVING WINDOW DISPLAY 

This is a three-color video display using a shadow-mask CRT and MOS storage for display 
refresh. The present choice is the RAMTEK GX-100/200 series configured to hold a 5-bit color 
vector with 480 x 640 elements on the display. The unit will require modification to allow direct 
data input from the MIDAS as well as normal input from the PDP-11/ 45. 

The unit offers image display, alphanumerics, vector generation, moving window display, 
and table look-up of predetermined colors. It provides the major means for a fast man-machine 
interface. A scene or portions of a scene can be displayed almost immediately upon user re- 
quest, provided the computer can access the data quickly. Once the MOS memory is loaded, the 
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computer can be performing other operations. Using the display memory as a source, the unit 
appears capable of driving external hard-copy devices while the user is viewing the CRT. 

5.2.4 HIGH-DENSITY DIGITAL INPUT 

In the present design of the MIDAS System, digital data are presented to the classifier by 
reading data from standard computer compatible tapes (CCTs) into the PDP-11/45 computer 
and from there to the classifier through a DR-11B interface. The maximum rate at which data 
can be processed in this mode depends on the throughput of the input tape units. The present 
MIDAS configuration uses tape units that operate at 45 ips and offer a maximum byte density of 
800 per inch. With these units an average byte rate of approximately 35,000 per sec can be pre- 
sented to the classifier. 

In order to increase the rate at which digital data can be processed, a data source offering 
higher throughput than the present tape units would be required. If a currently available high- 
density digital tape system (HDT) were designed into the MIDAS system, data rates in the order 
of 150,000 data vectors per second could be realized. The HDT system uses a wide-band 
(with a 1.5 MHz cutoff) multichannel tape recorder to encode and record the digital data in 
serial form (NRZL). A special interface unit would be required to read the data from this type 
of recording system into the MIDAS classifier, figure 24 shows a block diagram of the HDT 
interface unit. 

Multichannel digital information is recorded on parallel tracks of the HDT system, one 
track for each channel. The data from one track comes off the HDT system in serial form 
(NRZL). This serial bit stream is converted into parallel 8-bit words in the serial-to-parallel 
converter. These words are then stored in the deskew memory until the data from all the tracks 
are in their respective memory units. The multichannel information for a resolution element 
is then stored in output three-state registers where it is multiplexed into the MIDAS System. 

5.2.5 COLOR PRINTER 

An extremely important device for the MIDAS System is a fast color printer. A number of 
methods have been proposed for this function including CRT-color filter camera devices, multi- 
laser film devices, LED color array systems for printing on film, and colored-ink jet printers. 

Of these, the most attractive method is the ink-jet printer since it immediately produces a usable 
output, with no delay, for color film processing. This printer has been developed by several 

7 

sources at present and appears capable of producing a picture containing about 10 elements in 15 
minutes or less. An alternative possibility is a multi-LED scanning film printer using a Polar- 
oid or Eastman fast color development method. Its feasibility depends, however, on adequate 
brightness in experimental LEDs as well as on the resolution and color fidelity obtainable. 
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Some effort will be devoted in Phase II to study and evaluation of suitable color printing 
techniques. 

5.2.6 COMPUTER PERIPHERALS 

Other computer peripherals to be added to the PDP-11/ 45 system during this period in- 

g 

elude: (1) a mass storage (58 x 10 byte) disk, (2) additional core memory (32K words). (3) 
memory-management hardware, (4) a fast line printer (300 to 600 lines per minute), and (5) fast 
tape units (150 ips, 9 track, 800 bpi). Principal reasons for these additions are; (1) to facilitate 
Phase-II software development wherein a real-time-sharing executive-system for all operating 
modes will be developed that will normally require the simultaneous involvement of three pro- 
grammers; (2) to speed up data handling for framed data such as ERTS; and (3) to improve the 
operating speed of programs using large quantitites of data by increasing core residency. 

These reasons are discussed below as they relate to each peripheral. 

5. 2. 6.1 Line Printer (Printer/Plotter) 

The acquisition of a medium- to high-speed line printer is necessary if the development of 
the complete set of operating software proposed for Phase II is to be completed. In Phase I, 
difficulties were encountered in obtaining timely hardcopy listings when a two-man software 
effort was being made. With the full-time effort of three programmers and with the increased 
amount, scope, and complexity of the Phase II software, the effectiveness of the programming 
staff would be severely curtailed if it were necessary to rely on the non-dedicated use of a low- 
speed line printer (approximately 120 cps) or the console keyboard/ printer capability (approxi- 
mately 30 cps). Turnaround times currently approximate several hours during prime working 
hours for moderate-size software listings (^20 to 30 pages), whereas this could be reduced by 
a factor of 10 with the acquisition of a line-printer capability dedicated to the PDP-11/ 45 
MIDAS System. 

In addition, an electrostatic printer/plotter seems an increasingly desirable acquisition. 
While this type of printer/plotter system would fulfill the printing requirements admirably 
(approximately 600 lines per minute), the inclusion of a hardcopy plotting capability at a small 
additional charge id extremely attractive in view of recent price decreases by the leading manu- 
facturers of this type of system. This plotting capability would be extremely helpful in providing 
hardcopy of graphs, histograms, cluster-plots, etc., generated in both pre- and post-classifica- 
tion analysis of the data. 

5. 2. 6. 2 Large-Capacity Disk Drive 

In the Phase I implementation of MIDAS, the secondary storage capacity via random-access 

g 

disk was limited to 1.2 x 10 words of data. This number severely limits the amount of data 
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that can be handled in a single pass of the input data source. Since one of the primary ob- 
jectives of Phase II is the interactive communication between user and the MIDAS Classifier in 
a relatively short time frame, it is necessary to be able to access large amounts of data in an 
extremely brief period of time — i.e., in less than ten seconds, rather than in a period of min- 
utes. If the data to be accessed resides on the disk (with average access times of approxi- 
mately 30 msec), input data can be obtained in a time frame of less than 1 sec, and subsequent 
data transfers (once the read heads are positioned) can take place at rates on the order of 
3.12 x 10 bytes/ sec. 

In addition, if we were to attempt the post -classification analysis necessary to establish 
accuracy, acreage totals, etc., it would be necessary to store the classification results on a 
secondary (i.e., noncore memory) storage medium. No peripheral device other than a disk of 
this capacity fulfills the following necessary requirements: (1) the ability to store data at a rate 
comparable to the speed of the input data source as well as to that of the MED AS Classifier hard- 

g 

were, and (2) the capacity for storage of data on the ordei of several megabytes (10 bytes). 

The disk configuration currently being investigated consists of a dual-spindle drive, and 
thus contains two 29-megabyte disk cartridges with separate moving heads. Thus, if one drive 
were dedicated to input and one to output, no alternating head repositioning would be necessary; 
and the system could be driven at close to the maximum rate. 

Another proposed feature of the MIDAS System for Phase II is the capability to expand or 
decrease the scale of the display generated on the color CRT. To do this in a mode which allows 
quick response to user commands, random access to stored data is likewise necessary in a 
time frame of seconds in order to maintain an interactive system — and this is possible only with 
a large-capacity, high-speed, random -access disk system. 

If a large capacity disk drive is not included for Phase II, interactive uses of the system will 
be severely crippled. With only a low capacity disk, all the programs and data sets necessary 
for interactive use can not be kept on disk. Instead, magnetic tape will have to be used as storage 
for some high -use programs and data-sets. Then in order to run a program, the tape will have 
to be searched and the program loaded, all at low data-transfer rates. If a series of programs is 
to be executed repetitively or a data-set used repetitively, the MIDAS user will have to wait a few 
minutes for each program to be found, a few minutes for each program to be loaded and started, 
and a few minutes for each data-set to be accessed. This defeats the purpose of the Phase II 
MIDAS — namely, the ability of the hardware to interact with the user in a short time frame, 
hopefully less than a few seconds. 

5. 2. 6. 3 Expansion of Memory and Addition of KT-11C Memory Segmentation Unit 

During the Phase I implementation of MEDAS, programming difficulties arose trough lack 
of adequate amounts of core memory storage. In particular, limitations were necessary on the 
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number of pixels which could be used in signature extraction via the SIG and STAT programs. 

In Phase II, in order to increase system throughput of data, multi-level buffering of I/O will be 
necessary in some instances, depending on the peripherals in use (e.g., magnetic tape CCT in- 
put and disk output will require buffering both on input and output to efficiently use the capa- 
bilities of the MIDAS Classifier hardware). 

The amount of additional core memory necessary for Phase II cannot be absolutely deter- 
mined; however, 32K words seems the logical choice for the following reasons; 

(1) The allocation of 32K into 8K for additional program storage, 16K for input buffers, and 
8K for output buffers is appropriate in terms of quantity of data input vs. quantity of data out- 
put. For ERTS data, in particular, four channels (and thus four bytes) of input data will usually 
generate two bytes of output data (recognition result and exponent), hence the two-to-one ratio in 
the allocation of I/O buffers. 

(2) 16K boards are currently available; from past experience these are more reliable (by 
virtue of fewer ICs, connectors, etc.) than 8K modules and also more cost-effective in terms 
of bits per dollar. (Note; It is not possible to mix 8K modules and 16K modules because of 
chassis requirements.) 

5.3 SOFTWARE 

The software effort for J hase II will address two principal objectives: (1) the generation of 
a well -designed operating system for the user/operator to control and use MIDAS; and (2) gen- 
eration of the new programs to perform calculation and setup of the preprocessor, to operate 
the display and print subsystems, and to control format and sampling of data input. 

The modes of operation described in Section 5.1 assume the presence of the Phase II hardware 
and software and illustrate the manner in which the Phase II software will be used. The Phase-I 
software ma> bethought of as a skeletal version of the software needed for the system; all of it 
is, indeed usable in Phase II. There has, however, not been enough time or funds available in 
the Phase I effort to attempt to manage the elements of the software system with an efficient 
operating system or, moreover, to project the kind of software operating system a user may need 
when the MIDAS system becomes more operational. 

An important task of Phase II, then, is to project the framework of an operational system 
within which to develop and assemble the Phase II software system. Several environments may 
be considered likely for a MIDAS System as processing technology moves more toward opera- 
tional use. A time-shared, multi -terminal version of MIDAS may prove, for example, to be ?n 
efficient means of providing more output for more users by taking advantage of the speed of 
MIDAS versus the slower interactions of the human user. In this version, it is important to 
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define the software modes and modules carefully and assess the execution time and operator 
time for each. From this, we can determine how many users with what kinds of terminals 
could effectively use MIDAS. 

On the other hand, it may be desirable to provide remote time-sharing of the system by 
having various modular remote terminals. Estimates of communication loads and cost savings 
over conventional computer usage are neede* to verify the utility of such an operation. Such a 
mode may well offer considerable economy. 

Because of the above considerations, the Phase-II software modules and operating system 
will first be examined in the light of subsequent use as parts of more advanced systems and then 
developed, insofar as possible, so that they may be employed in such systems with little or no 
modification. This constitutes the initial design step in the development of the software. 

Next, given these constraints and also an overview of MIDAS operational modes as dis- 
cussed in Section 5.1, a MIDAS executive system will be designed to allow an operator/user 
to employ the MIDAS System for simple tasks or with considerable flexibility, as needed. This 
executive system may be overlaid on available multiprocess systems or could be designed to 
replace such a system. 

Briefly then, the software design will include the principal tasks shown in Tables 5 through 
10 and discussed in the following task description: 

(1) Investigate an operating system to be used to control loading, linking, assembling, etc., 
for MIDAS software system: 

(a) feasibility of using the newly released (August 1973) DOS/ BATCH operating system. 
This is the standard Digital Equipment Corporation software system supported by 
DEC. 

(b) possibility of using a multi-user, multi-tasking system with MIDAS System. If this 
approach is implemented, possible candidates are RSX-11D or RSTS-11E, both be- 
ing software products of DEC. 

(c) feasibility of writing our own operating system incorporating system programs 
available from DEC, (e.g., the MACRO- 11 assembler, the Editor, etc.) tailored to 
our own specific needs with the MIDAS System, e.g., possible multi-user graphic 
CRT consoles, or several single-user -dedicated disk units. 

(2) Determine the software functions to be implemented in Phase II of the program. These 
will include programming in the following areas: 

(a) data format and/or storage media conversion 
Examples: 
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Function 


Data Input 


Tape Control 


Display 


TABLE 6. MODE I - SETUP 


Requirements, Controls 


Software 


No. of Channels 
Ident. of Channels 
Nu. of Samples 
Location of Samples 
16 Locations — 8SS, 8RS 
(SS = Scan Sync 
RS = Roll Sync) 

Sampling Rates 
2C, C, C 2, C/3 . . . 

(C - clock) 

Line Count - 
(2 Words -Line Count) 

(l Word-Angle) 

Enabled, Disabled 
Internal, External 
Line-to-Line Update 
Line Search — Fwd, Rev 
Data Formats 
Byte Pack {2 samples/ 
word) 

Word Pack 
Single Sample-1. D., 

No I.D. 

Averaged Sample 
Time Data 
At(SS-RS) 

At(SS -8S J + 1 ) 

Speed; Fwd, Rev; Filter 
Count/ Zero Cross; Record; 
Stop 


\ Software Control of 
f DR-liC Interface 


} 


Input 


Control 

J 


Multichannel (for Analog 
Tapes) 

Address & Data -Decode 
to Channel 

Display 
Input (Digital) 

(X - u)/o 
Exponent 

Classified Output (Class # 
Selectable) 

Mixed Input and Class Output 
Linear Combinations of Inputs 
Syncs for Recording (RS, SS) 
Dead Time Control 


^ Control 


"h Bus Select 


! I/O Control 


1 


Control 


Di. f/.ay Hardware 


I/O Interface ior 
DR-11C and DR-11B 


) Indicators Printout 


} 


Indicators/Printout 


\ CRT-P7; Moving 
( Window, Color 
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- conversion of data from various formats to one more compatible with the MIDAS 
System 

- transfer of data from CCT (i.e., slow input data source) to HDT (high-density 
digital tape, i.e., a fast input data source) for multiple-pass input applications 

(b) improved signature extraction and analysis 

(c) pre -classification analysis 

1, determination of angle effects, both additive and multiplicative 

2 selection of a ratio transform 

3 selection and calculation of a linear transform 

(d) MIDAS hardware set-up programming — i.e., preparing the MIDAS control logic, 
loading up RAM coefficients for use in a classification run, preparing proposed pre- 
processor with appropriate functional parameters 

(e) control of classifier in actual classification operation, using various input and 
output media 

(f) display of data 

1^ grayscale and/or color display of either multispectral data or classification 
results with zooming and scrolling capability 

2 scatter plots of classification results 

3 histograms 

(g) post-classification analysis of data 
1_ area totalling (with histogram capability) for classification results 
2 ground-truth comparison 

(3) Define standard I/O formats acceptable to the system for the following peripheral de- 
vices: 

(a) industry-compatible digital magnetic tape (7- and 9-track) 

(b) high-density digital tape 

(c) disk-file formats 

(4) Generate documentation for all programming to be implemented prior to actual coding. 
This will permit evaluation of the MIDAS System as an integrated whole before any program- 
ming has been performed. 

(5) Determine the ramifications of a possible Phase III which would implement a multi- 
user time-shared system for use of MIDAS hardware. In order to avoid the necessity of recod- 
ing programs in a possible Phase III, it will be necessary to specify a multi-user operating sys- 
tem which will incorporate Phase-II program implementations after relatively minor modifica- 
tions. 
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(6) Evaluate software system performance in terms of human/ machine interaction. Since 
the primary emphasis of Phase II is to increase the efficiency of the user in producing a final 
product, this evaluation will take place at all stages of the software system development and 
include interaction with various existing multispectral processing user groups. It will also 
include determination of the execution and user times for all software modules as a basis for 
multi -processing or time-shared operations. 


5.4 STUDIES 

5.4.1 GEOMETRY CORRECTION 

Previous investigations have revealed various requirements for spatial resolution and lo- 
cation accuracy in different user applications. Over a wide range of such applications there 
appears to be a strong correlation between the resolution required and the location accuracy 
needed: on the average, a difference factor of four seems to be the upper limit. In other words, 
if a given application requires a resolution of 50 ft to see and classify a certain object, then the 
classification map should show the same object located no farther than about 200 ft from its true 
location. This is not to imply that the locations shown need be this accurate with respect to 
geodetic or earth coordinates, but rather that on a classification map, objects of interest should 
be positioned to at least this accuracy with respect to local landmarks. A study of the possible 
geometrical correction approaches that could be used to meet the above criteria should be under- 
taken. 

5.4.2 SYSTEM OPERATION 

As part of the operation of the system in Phase II, a number of studies and evaluations 
should be made. In using the system it should be possible to predict the improvements in per- 
formance and processing costs that will be realized by the interactive man/ machine capability. 
The time that is saved by this capability will contribute to lower processing costs. 

Although MIDAS is designed basically as a complete, stand-alone system with a mini-com- 
puter performing all the control functions, a study of how this system could interface to a large 
general-purpose computer system is desirable. Conceivably, for example, an operational sys- 
tem having an extremely large data base might well feature a large general-purpose computer 
and one or more MIDAS Systems as peripheral processors. 

The operational MIDAS System can accept data from analog tapes or CCT tapes and will be 
upgraded in Phase II to handle data from high-density digital tapes. This should enable the sys- 
tem to accept and process data from many scanner sources — for example, from a specified 
scanner sponsored by AAFE (Advanced Applications Flight Experiment). In processing these 
and other data, system performance can be stated in terms of data throughput. Then knowing 
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the limitations of the system developed under Phase II, we should be better able to specify the 
special-purpose digital processing concept which would process data at much higher data rates. 
Current data rate through the MIDAS Classifier is about 12 megabits/ sec. The investigation 
should examine data rates on the order of 100 megabits/ sec. 

In evaluating the next generation system, some thought should also be given to a more ad- 
vanced processing concept. The present system uses a single hardwired quadratic decision-rule. 
While it is possible to design a generalized quadratic decision-rule which includes a linear de- 
cision-rule, the hardware may prove prohibitive in cost. Thus a study of hardware and cost 
impacts and trade offs is desirable. 
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6 

HARDWARE DESCRIPTION 

6.1 MIDAS CONTROL SECTION 

The MIDAS System has three distinct modes of operation: 

(1) Classifier mode 

(2) System A/D conversion 

(3) System D/A conversion 

In Mode 1, the MIDAS System accepts multichannel data in either analog or digital form 
and synchronizes the conversion and flow of this data to the classifier. 

In Mode 2, the MIDAS System accepts multichannel analog data, converts it to digital 
form, and synchronizes the transfer of the resultant data to the PDP-11/45 computer. 

In Mode 3, digital data is transferred from the PDP-11/45 to the hybrid D/A subsystem 
where it is converted to analog form. 

The section of the MIDAS System that establishes the operating mode and controls the 
flow of data from source to destination is referred to as the control section. A block diagram 
of the control section is shown in Fig. 25. In the following paragraphs we describe the blocks 
that make up the control section design and detail their interrelations. 

6.1.1 SYSTEM CONDITIONER 

The conditioning of the MIDAS System to one of its operating modes is accomplished in 
the system conditioner of the control section. This is achieved by sending codes that define 
the mode parameters from the PDP-11/45 through a DR-11C interface to the system condi- 
tioner. The defining codes are then stored in the conditioner where they are decoded and 
transferred to the appropriate sections of the system. The parameters that define the system 
mode and their corresponding codes are detailed in the section below. 

MIDAS CONTROL COMMANDS (TO DR-11C) 

A. Command Format 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 


command 

data bits 

bits 



Bits 15-12 (highest -order bit - 15) are used to specify one of the 16 com- 
mands. 

Bits 11-0 are used to pass command information to the processor control 
logic. 
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B. Commands (to DR-1 1C) 

The commands to the system conditioner follow, listed in order of 
binary command. 

(1) 0000 - Register Interrogate & Misc. 

15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 


0 0 0 0 

not 

register 

misc. 


used 

[ interrogate 



Register Interrogate - 

bit 7: 0 - register interrogate disable (disregard bits 6-4) 

1 - register interrogate enable 

bits 654: 

000 - least -significant byte (LSB) of line count register 

(10 bits) 

001 - most -significant byte (MSB) of line count register 

(10 bits) 

010 - scanner oeriod (12-bit binary) 

011 - scanner,/ roll sync interval (12-bit binary) 

Miscellaneous - 


bits 3210: 


0001 - 
0010 - 
0011 - 
0100 - 
0110 - 
0111 - 
1000 - 
1001 - 
1010 - 
1011 - 
1100 - 
1101 - 
1110 - 
1111 - 


no action taken 

reset 

start 

gate reset 
clock strobe 
transfer 
A/D strobe 
start reset 

reset CSR0 (on DR-UC) and ATTN bit (on DR-UB) 


spare 

n.aO 


n.a. 


n.a. y 


n.a. 


n.a. J 


no action taken 
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(2) 0001 - A/D Channel Select RAM Load 


15 

14 

13 

12 

11 10 9 

8 7 6 5 4 

3 2 10 

E 

0 

0 

1 

not 

used 

address 

channel 

data 


bits 7-4: RAM address for channel number to be stored 
bits 3-0: channel number to be stored in RAM addressed by 
bits 7-4 

(3) 001; x - Video Gate RAM Load 

15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 

0 0 10 resolution # to be loaded into 

video gate RAM 

bits 11-0: Load up Video Gate RAMs in order. Addressing is 
performed by the hardware starting at address 0 
after a gate reset command L' issued. Maximum 
of 16 video gate limits can be loaded (2 limits 
per gate). 

(4) 0011 - Calibration Gate RAM Load 

15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 

0 0 11 resolution # to be loaded into 

calibration gate RAM 

bits 11-0: Load up calibration gate RAMs in order. Address- 
ing is performed by the hardware starting at 
address 0 after a gate reset command is issued. 
Maximum of 16 calibration gate limits can be 
loaded (2 limits per gate). 

(5) 0100 - A/D Condition Load 

15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 



bits 3-0: total number of channels to be selected from analog 
tape for digitization 

bits 7-4: total number of gates per scan line (video gates 
plus calibration gates) 
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bits 9-8: A/D mode select 
bit 98: 

00 - deselect A/D 

01 - A/D analog mode (i.e., data transfer to computer 

memory via DR-11B) 

10 - A/D classify mode (i.e., data transfer directly 

to classifier hardware via hybrid circuitry 
from analog tape) 

11 - A/D external mode (i.e., data transfer directly 

to classifier hardware via hybrid circuitry 
from an external source) 
bit 10: line start/ stop enable 

- value of 0 — disable line start/ stop hardware 

- value of 1 *■, ^ble line start/ stop hardware 

bit 11: data form t witch (relates only to A/D analog mode 
transfer via DR-11B to core) 

- value of 0 — one 8-bit data value per 16-bit bus 
transfer 

- value of 1 — two 8 -bit data values per 16 -bit bus 
transfer 


(6) 0101 - D/A Channel Select RAM Load 


15 

14 

13 

12 

11 10 9 

8 

7 0 5 4 

o. 

t-H 

CM 

CO 

E 

1 

0 

1 

not 

used 


address 

channel 

data 


bits 7-4: RAM address for channel number to be stored 
bits 3-0: channel number to be stored in RAM addressed by 
bits 7-4 


(7) 0110 - D/A Duty Cycle 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 


0 110 


total number of resolution elements 
per scan line 


bits 11-0: total number of resolution elements to be generated 
per scan line (i.e., between generated sync pulses) 
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(8) 0111 - D/A Video Resolution Cycle 

15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 


0 111 

total number of video res'n elements 


per scan line 


bits 11-0: total number of video resolution elements to be 
generated per scan line 

(9) 1000 - D/A Condition Load 


15 14 

13 

12 

11 

10 9 8 

7 6 5 4 

3 2 10 

1 0 

0 

0 


D/A 

mode 

not used 

# of 

channels 


data format 


bit 11: data format switch (relates only to the D/A mode of 
transfer from core directly to analog tape via the 
DR-11B) 

- value of 0 — 1 8-bit data values per 16-bit bus transfer 

- value of 1 — 2 8-bit data values per 16-bit bus transfer 
bit 10-8: D/A mode select 

bits 1098: 

000 - not assigned 

001 - digital classification (from core) 

010 - digital classification (from external source) 

011 - digital transfer 

100 - diagnostic check of three -states (digital word 

from computer to computer) 

101 - diagnostic of A/D-D/A (from computer to D/A 

thru A/D back to computer) 

110 - digital -to -analog conversion 

111 - not assigned 

(10) 1001 - Clock Control 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 


10 0 1 

D/A line 

A/D 

A/D 

A/D 


count 

resolution 

calib. 

video 


clock 

rate 

gates 

gates 


bits 11-9: 

D/A 

bits 8-6: 

A/D 

bits 5-3: 

A/D 

bits 2-0: 

A/D 


line count clock 
resolution rate 
calibration gates 
video gates 
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Clock Control Schedule (for 1001 Command) 
Gate Clock - bits 5-3 (calib.) of 2-0 (video): 


Sample Rate 543 

(f = resolution rate) 210 

4f 000 

2f 001 

f 010 

f/2 011 

f/4 100 

f/8 101 

f/ 16 110 


A/D Resolution Rate - bits 8-6: 


Tape Speed (ips) 

60 

30 

15 

7-1/2 

3-3/4 

1-7/8 

15/16 

(11) 1010 - Not assigned 

(12) 1011 - Port Select 


876 

000 

001 

010 

011 

100 

101 

110 


15 

14 

13 

12 

11 10 9 8 7 6 5 

4 

3 

2 1 0 

[I 

0 

1 

3 

data j 





data: to be assigned 




1100 

- Classifier Clock 




15 

14 

13 

12 

11 10 9 8 7 6 5 

4 

3 

2 1 0 

F 

1 

0 

0 

BURST N-l 



freq. 

divide 



Burst stop / reset 
Free Run/ Burst 
mode 
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bits 2-0: FREQUENCY DIVIDE 


Clock Rate 


3.2 MHz) 

210 

f 

000 

i/2 

001 

f/4 

010 

f/8 

011 

f/ 16 

100 

i/22 

101 

f/ 64 

110 

f/128 

111 


bit 3: 0 —BURST/STOP 
1 —BURST/RESET 
bit 4: 0 —FREE RUN MODE 
1 —BURST MODE 
bits 11-5: BURST N (STORE N-l) 

(14) 1101 - Line Count and Start/Stop Control 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 


110 1 


function 


data 


bits 11-10: function to be performed depends on current processor 
mode (l.e., either A/D or D/A): 

(1) When processor is In D/A Mode 


bits 11 10 

0 0 - load LSB of line count (10-bits BCD) 
0 1 - load MSB of line count (10-bits BCD) 

(2) When processor is In A/D mode 


bits 11 10 

0 0 - load LSB of line start (10-bits BCD) 

0 1 - load MSB of line start (10 -bits BCD) 

1 0 - load LSB of line stop (10-blts BCD) 

1 1 - load MSB of line stop (10 -bits BCD) 

Modes (1) and (2) Bits 9-0: data to be loaded 

(15) 1110 - not assigned 

(16) 1111 - not assigned 
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6.1.2 PROGRAMMER 

That part of the control section which synchronizes the flow of data from one of the Input 
ports of the MIDAS System to an output data bus is called the programmer. It is made up of 
two separate components; one of these, the digital synchronizer, is used to control digital in- 
put data to the system: the second component, the A/D synchronizer, controls analog input 
data. In the following sections, we discuss the way the two components of the programmer 
operate in the different MIDAS System modes. 

Digital Input Synchronizer . This section of the programmer receives *Urt commands 
from a clock source that starts the tranrfer of data from ♦he PDP -11/45 and finally controls 
the latching of the data into the appropriate register on the hybrid cards. The source of the 
initializing signal is dependent on the operating mode of t * 'tern. If the system is perform- 
ing a classification of the input data, the start command ci ip tes in the classifier clock con- 
trol. When the system is in the D/A mode, the request co. ' \ is generated in the D/A 
clock generator. A block diagram of the digital synchronizer is given in Fig. 26. 

The start command generates a request for data from the computer in the request logic 
of Fig. 26. Upon receiving the request for data in the DR-11B interface, the computer initiates 
an output data transfer cycle. When the transfer of data to the output bus is complete, the 
DR-1 IB sends an end data signal to the digital synchronizer. The end data signal starts the 
unpacking of the 16 -bit word on the output bus into two 8 -bit data samples which are loaded 
into registers on the appropriate hybrid cards. After a data word has been unpacked and 
stored, the current channel address is compared with the final address Goaded from the sys- 
tem conditioner into a 4 -bit latch). If the final hybrid card has not been loaded with data, a 
request for data is again sent to the computer. This sequence of transferring data from the 
computer is continued until all the channels are loaded. When the cycle is complete, the digi- 
tal synchronizer is reset, and the data in the holding registers on the hybrid cards are ready 
for classification or D/A conversion. 

Digital Output Synchronizer. This section of the programmer controls the operation of 
the MIDAS System when the input data are multichannel analog. With this type of input data, 
the system has two separate modes. In one mode, the multichannel analog is digitized, stored, 
and finally presented to the classifier for processing. The components of the system that per- 
form these functions are ail located on the hybrid card; the timing signals that control them 
are generated in the classifier clock. A signal from the clock logic (A/D strobe) starts the 
A/D converters on the hybrid cards. Another signal from the classifier clock generator (A/D 
latch) latches the converted data into registers and from there the data are presented to the 
output three-state gates of the hybrid cards. These multichannel data are then multiplexed to 
the classifier by controlling the enable lines of the output three-state gates with signal* 
derived from the address lines of the classifier clock. 

115 










> t 1 ' 1 J ! I 



i 


The second mode of operating the system with analog input data is the conversion of the 
analog data to digital data, and the transfer of the data to the PDP-11/45 for analysis. A 
block diagram of the A/D synchronizer that performs the data transfer is presented in 
Fig. 27. For this mode of operation, the A/D converters are controlled by an A/D timing 
generator which determines the sampling of the analog data. When an A/D conversion has 
been completed, the end of the A/D busy command starts the A/D synchronizer to pack and 
transfer words to the computer. The words are packed by enabling the three -state gates of 
two data channels for each word transfer. The trailing edge of the A/D busy signal sets the 
start flip-flop in the synchronizer. When the start flip-flop is set and the cycle logic is in a 
reset state, the cycle logic generates three signals that pack and transfer one word to the 
PDP-11/45. After these signals have been generated, the cycle logic remains in a set condi- 
tion. The end cycle from the DR-1 IB, which tells the synchronizer that the word has been 
accepted, resets the cycle logic. If the start flip-flop is still in the set state, the cycle logic 
initiates another word transfer. This sequence continues until the start flip-flop is reset and 
the cycle is complete (i.e., all channels for the A/D sample pulse have been transferred). 

The first signal out of the cycle logic checks the comparator to see if the cycle is com- 
plete. If all the channels have been transferred, the signal out of the comparator resets the 
start flip-flop. If the cycle is not complete, a second signal is generated in the cycle logic 
wh.ch loads the address counter into the lower byte register and then updates the address 
counter. A third signal which loads the address counter into the upper byte register and re- 
quest. a data transfer to the DR-11B is then generated. The byte registers are decoded in 
the channel decoder which, in turn, enables appropriate three -state gates on the hybrid cards. 

6.1.3 DIGITAL DATA SELECTOR 

The digital data selector of the control section performs the following functions: 

(1) Accepts two 16 -bit input words — one from the computer unibus, and the other from 
the external digital input port. 

(2) Selects one of the two input words. 

(3) Sequentially unpacks the selected word into 8 -bit samples. 

A block diagram of the digital data selector is given in Fig. 28. 

The two 16-bit input words to the data selector are divided into 8-bit bytes by feeding 
the lower bit of the two words to one 8-bit word selector (A) and the higher order bits to 
another (B). The input data source can then be selected by appropriately controlling the 
select lines of the word selector. 
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FIGURE 28. BLOCK DIAGRAM OF THE DIGITAL DATA SELECTOR 
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The outputs of the word selectors are terminated in three state gates. The gate outputs 

of the corresponding bits from the lower and higher order byte: are tied together. Tha' is, 

0 8 19 

the 2 three-state gate is tied to the 2 three-state, the 2 to the 2 , etc. 

In this way the 8 -bit output lines of the data selector are formed. The unpacking of ih' 
16-bit input word is achieved by alternately enabling the three-states of the lower and highe. 
order bytes. The control of the three -state enable lines is generated in the programmer of 
the control section. 

6.1.4 CHANNEL DECODER 

The function of the channel decoder is to decode the channel address supplied by the pro • 
grammer and to generate a channel enable signal to the hybrid cards. 

When the system is operating with analog input data, the channel enable signal selects 
the appropriate three-state gate of the hybrid cards. This insures that the correct channel is 
presented to either the classifier or the PDP-11/45 under the control of the programmer. 

For the system operating with digital input data, the channel decoder not only enables the 
output three-state gates of the hybrid cards, but it also generates the strobe pulse for latch A 
of the hybrid cards. This strobe is timed by the programmer to insure that the data are latched 
into the correct hybrid card. 

6.2 HYBRID CARDS 

The MIDAS System can accommodate multichannel analog or digital data. The source for 
this data may be the 16 analog inputs of MIDAS, the 16 external inputs of the system, or the 16- 
bit data lines from the PDP-11/45 (through a DR-11B interface). The particular data source 
is selected through the system conditioner of the control section. Irrespective of source, all 
the data are presented to separate hybrid cards within the control section where it is con- 
verted (where required), stored, and eventually sent to the selected destination. 

In MIDAS there are 16 hybrid cards, one for each data channel; a block diagram of the 
cards is given in Fig. 29. 

If the selected source for input data is multichannel analog, the separate signals are 
brought from the 16 analog input jacks to their assigned hybrid cards. The analog signals 
are converted on the cards to digital form and stored for eventual presentation to the selected 
destination. This destination is the 8-bit input lines of the classifier when the system is oper- 
ating in the classifier mode. When the system is operating in the system A/D mode, the out- 
puts of the hybrid cards are transferred to the PDP -11/45 computer. 
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For analog data the hybrid card is operated in the following manner: The analog signal 
is fed to a sample-and-hold amplifier located on the hybrid card where the signal is held during 
the time it is being converted to digital form. The A/D conversion is performed by an 8 -bit 
converter on the card and the control of the converter is generated in the programmer of the 
control section. Upon completion of the A/D conversion the digital data is stored; then it is 
presented to the output three -state gates through a word selector. The word selector acts as 
a data switch; it selects word A when the system is operating with analog input data and 
selects word B when the input data are digital. 

If the selected input data are digital, the source for this data may be the 16-bit word from 
the PDP-11/45 or the 16 -bit external digital input port of the MIDAS System. The selection 
of data source is established and stored in the system conditioner and the multiplexing and 
unpacking of the digital data is accomplished in the digital -data selector of the control section. 

For digital input data, the hybrid cards operate in the following manner. After the data 
source is selected and the 16 -bit data word is unpacked into 8 -bit samples, the 8 -bit samples 
are stored sequentially in latch "A" on the assigned channel hybrid card. After the data for 
all the channels has been loaded into the corresponding A latches, it is transferred simulta- 
neously into latch "B." The data from latch B are fed to the word selector on the hybrid card; 
then, when the system becomes conditioned to accept digital input data, the word stored in 
latch B is fed to the output three -state gates through the word selector. As indicated on the 
block diagram of the hybrid card (Fig. 29), the contents of latch B are always presented to the 
D/A converter. In this way the conversion of the digital data stored on the card can always 
be observed at the output of the D/A converter whether the system is in a digital classify 
mode or a system D/A mode. 

The output three -state nand gates on the hybrid cards form both 8 -bit and 16 -bit output 
dat~ lines; these lines are formed by joining together the corresponding outputs of the three- 
state gates on each card. The data from a particular card or cards can then be presented to 
the appropriate destination by enabling the three -state gates of the desired card. The 8 -bit 
data line is used to present data to the classifier and the 16 -bit line is used to transfer data 
to the PDP-11/45. By using a 16-bit data line from the hybrid cards to the PDP-11/45, two 
8 -bit data samples can be loaded onto the input unibus for each transfer cycle in order to 
achieve the maximum throughput efficiency. 

Each hybrid card contains a diagnostic relay which is used to calibrate the cards and to 
diagnose faults utilizing the PDP-11/45. 
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6.3 CLASSIFIER SECTION OF THE SYSTEM 

The classifier section of the MIDAS System consists of four bays, each containing 13 
wire -wrap cards. These bays are nearly identical in that each has the circuitry for one 
quadratic pipe computation as described in Subsection 4.2.1. This computation requires 
twelve cards: (a) a mean card, (b) a variance card, (c) eight matrix multiplier cards, (d) a 
squaring card, and (e) a square accumulate card. These card types are described in follow- 
ing subsections. In addition, each bay contains one of the following one-of-a-kind wire wrap 
2 

cards: k card, recognition card, diagnostic/output card, and clock card; these wire-wrap 
cards are also described in succeeding text. 

The wire -wrap hardware employs Augat 8136-URG1TG universal circuit boards and 8170- 
RG1 card racks and back planes. The cards can hold up to fifty 16 -pin integrated circuits 
(ICs) or eighteen 24 -pin ICs. 

6.3.1 MEAN CARD 

The prime purpose of this card is to sequentially add -p, to the data vector using two’s 

complement arithmetic. An overflow test which is defined by A C *B ‘(L + A *B *C e = over- 

o O b o o 5 

flow, where A + B = C and the subscript S indicates sign bit. The -p is stored in an 8 -bit 
RAM. The X inputs are multiplexed from a bus located in the A/D cards. The output of the 
adder is loaded into latches 74174 as processor outputs and into 8T10s for diagnostic outputs. 

A block diagram of this card is shown in Fig. 30. 

There are two selection devices on the mean card, one to select a RAM in that bay for 
writing into that RAM and the other to select a diagnostic bus output to the diagnostic card. 

In the RAM selection, a RAM write select comes from clock card which does a bay select 
decode to enable the decoder on the mean card; this also sends 4 bits to the decoder on mean 
card. A read pulse is then applied which can now propagate through only one decoder input 
on one mean card to a RAM on one other card in that bay. In the diagnostic bus output selec- 
tion an identical procedure is followed except that the origin of the signals is on the diagnostic 
output card. 

6.3.2 VARIANCE CARD 

The prime purpose of this card is to multiply an 8-bit number by a 12 -bit number and put 
out an 8-bit result with overflow. There are no latches in the processing stream on this 
card — but the diagnostic bus has a 9 -bit 8T10 latch. The input X-p is represented by 

XXXXXXXX. (2’s complement number) 

The coefficient 1/a is represented by 

X.XXXXXXXXXXXX (2’s complement number) 
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FIGURE 30. BLOCK DIAGRAM OF THE MEAN CARD 
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Since 1 /a is never negative, the sign bit is always zero. The eight output bits of the 20-oit 
product are selected from 

X XXX.XXXX 

11 11 76 0 

98 43 

The overflow test simply consists of testing bits 14 through 19 to see whether they are ail at 
the same level. If so, there is no overflow and the overflow output is high. A block diagram 
of this card is shown in Fig. 31. 

6.3.3 MATRIX MULTIPLIER 

This card has two main functions: (1) multiply [(X - \i)/v) x by a coefficient, and (2) 
to accumulate several products of this multiplication. The l(X - li)/o] { input from the variance 
card is stored in a latch on this card. The accumulator is cleared at the start of each signa- 
ture computation. Each 16-bit product of the multiplication is truncated to the high-order 10 
bits and latched before being applied as one input to the accumulator. The accumulator con- 
sists of a 12 -bit adder with a feedback latch providing the second input. The two expansion 

g 

bits in this accumulator are adequate for eight adds since the largest product is 2 (the re- 

7 7 11 

suit of (-2 ) x (-2 ) after truncation). Added eight times, this number yields 2 , which is the 

largest negative 12-bit number. Siw:? the two operands were negative, the accumulated prod- 
ucts should be positive. Hence, for this special case the overflow has caused an error in sign 
only. However, this number is t< be squared iiTthe next step, thus correcting the only possi- 
ble error. There are processor data -stream output -latches (8T10s with 12 bits) and diagnos- 
tic bus latches (8T10s, also with 12 bits). These latches are enabled at the proper time by 
the MTXMPX signal from the square-accumulator card and loaded by C0, the fundamental 
clock frequency. A block diagram of this card is showt *n Fig. 32. 

6.3.4 SQUARE CARD 

The prime purpose of this card is to do a 9 x 9 square on the 12-bit input from the ma- 
trix accumulator. The first operation is to check for overflow in the matrix -accumulator 
input signal before reduction to nine bits. The four high -order bits (8 through 11) are tested 
to see whether they are all at the same level. If so, there is no overflow and the overflow 
output is high. Bits 8, 9, and 10 are then dropped to reduce the square input to nine bits. A 
block diagram of this card is shown in Fig. 33. 

6.3.5 SQUARE ACCUMULATOR 

This card's main function is to accumulate the squares from the square card. It has 
both input and output latches as well as feedback latches. It has a 12-bit input (the sign bit 
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FIGURE 31. BLOCK DIAGRAM OF VARIANCE CARDS 
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FIGURE 33. BLOCK DIAGRAM OF THE SQUARE CARD 
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and the low-order seven bits from the 9 * 9 square are not brought in). The adder is 14 
bits but bit 0 is not used, so that three expansion bits are provided. The twelve most signifi- 
cant bits are brought out via 8T10s on both the processor data path and diagnostic path. The 

diagnostic bus is multiplexed from the diagnostic/output card, whereas the processor bus is 
2 

multiplexed from the k card. The square -accumulator card also has a number of control 
functions: 

(1) Acts as a buffer for the clocks (bay buffer). 

(2) Multiplexes the matrix multipliers (enables 8T10s on the two output busses which 
are then loaded at the next C0 clock). 

(3) Controls the matrix multipliers' output bus to the 9 x 9 square card using a 3 -line 
to 8 -line decoder. 

(4) Clears the matrix multipliers' accumulators. 

(5) Clears its own accumulator. 

(6) Multiplexes its own output 8T10s to the processor and diagnostic busses. 

(7) Contains overflow shift register. Overflows from mean, variance, and 9x9 square 
are loaded into the shift register and then dropped into the overflow output register 
(8T10s) at proper time. 

A block diagram of this card is shown in Fig. 34. 

6.3.6 k 2 CARD 

The main function of this card is to perform 10-blt by 12-bit multiply. The 12-bit data 
comes from the square accumulators and the 10-bit data is the k normalizing constant. Its 
input is buffered and there are no input latches since the outputs of the square -accumulator 
cards are latched and are bussed into it. There is a 74174 output latch register. 

Another function is to control the square accumulators’ bus, which it feeds using a 2- 
line to 4 -line decoder. The signal that multiplexes the output bus also multiplexes the over- 
flow bus to the recognition card. A block diagram of this card is shown in Fig. 35. 

6.4 FINAL DECISION 

The main function of this card is to determine which calculated material’s exponent is 

2 

the minimum. It also performs the Chi test on the sum of the squares. At clock times 5 

2 

and 13 a new set of sum of squares from the k card becomes available. A 14 -bit add is per- 
formed with the fn i Dl input truncated to bits 3 throi ^h 14, while the sum of squares is a 14- 
bit number in bits 0 through 13, 
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The selection time of the overflow into the recognition logic is the same as the selection 

time of the square accumulation output; therefore the overflow shift register output must have 

2 

two stages of latches to make up for the delay in the k multiply and in the Cn I D! add on the 

2 

recognition card. Similarly, the output of the Chi test must be latched to account for the 
Cn ID! add delay. 

^ new sequence of testing begins at clock time 8 for the 8-channel set-up or at clock time 
14 for the 4-channei set-up. At these times the previous recognition result is transferred to 
the output latches. There are two internal holding latches, one for exponent and one for mate- 
rial number. After loading these two quantities into the output latches, the internal ones are 
cleared. There is an artificial seventeenth bit set at this time which makes the stored expo- 
nent look like it is larger than any 16-bit computed exponent. Thus the first test will always 
succeed and the exponent be stored if no overflow has occurred. When this condition is met the 
artificial bit is reset. The material code is a 5 -bit number. It is MSB 000XXXX where the 
X’s give the material number (0 through 15), or code 16 indicates no material selected. A 
block diagram of this card is shov\i in Fig. 36. 

6.5 CLASSIFIER TIMING 

The classifier consists of the four ’’pipeline” computers in parallel whose outputs finally 
converge on the circuits to scale the exponents of the density function and intercompare these 
exponents for a decision. The sequence of operations can be visualized as shown in Fig. 37 
where data is shown entering the A-D converters at t = -16 in the upper left corner of the dia- 
gram. A sample, consisting of a vector of eight elements of eight bits each is passed through 
the computational circuits indicated and emerges at the bottom right of the diagram as a 
classification code of 5 bits. The general appearance and function of the arithmetic operations 
so diagrammed is that of a ’’cascade” in which the breadth of the cascade in time is propor- 
tional to the computational load of a particular circuit. 

Each ’’pipe” or ’’cascade” processes the computation of the quadratic form for the expo- 
nent of the Gaussian distribution for two distributions. There are, then, a sequence of alter- 
nating computations of the first exponent in cascade 1, the second exponent in cascade 1, the 
third exponent in cascade 2, etc. In the present machine, there are four such cascades oper- 
ating in parallel, allowing the computation of eight exponents at once. In phase II the machine 
will be expanded to eight cascades to allow computation of 16 distribution exponents at a time. 

The timing diagram, Fig. 37, shows the flow of two exponent computations and the result- 
ing decision, neglecting the fact, for the sake of simplicity, that the cascade would normally 
contain portions of other computations for the preceding and subsequent samples. Data is 
entered into the A-D converters and is available for computation at the end of 16 machine 
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FIGURE 36. BLOCK DIAGRAM OF RECOGNITION CARD 
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cycles, or approximately 5 microseconds. Data latched ir. the converter outputs is then sup- 
i plied sequentially (X^ f X^, . . . X fi ) via a multiplexer and subtracter to a latch, and one clock 

cycle per element is allowed for this operation. At t = 0, is subtracted from X^; at t = 1, 

^2 is subtracted from Xg, etc. Also, at t = 1, (X^ - is multiplied by (1/a yielding X*. 

At t - 2, X^ is supplied to each of eight multiplier -summers which compute the products 
(Xj*Pu), **81^ anc * en * er these into the summers. At t - 3, the multiplier- 

summers compute (Xg'P^), (X*-P 22 ) * • • ^ X 2' P 82^ and add these P roducts to the Previous re- 
suits. Thus at t = 10 the summers contain the complete sums of products for all matrix opera- 

» 

. j tions. Each of the eight multiplier-summers may, as a result, be considered as a row operator 

since it accomplishes the sequence of multiplications and summations for a particular row. 

This vector has the property as the result of the above transform that all its elements are 
uncorrelated, and therefore need only have its elements squared and summed to obtain the nor- 
malized quadratic form for the exponent. This is accomplished during cycles t = 10 to t = 18, 
allowing one cycle for each squaring operation and a final cycle for storage of the summation. 

At this point, the exponent must be re -scaled and the natural logarithm of the determinant 
of the covariance matrix added to obtain the final exponent of the density function. This re- 
quires two cycles. The comparison of these exponents, now becoming available from the 
normalization circuitry, begins as each exponent appears. The procedure is to examine all 
exponents sequentially to choose the smallest, assuming that one is less than a threshold test 
value which is entered first, and to retain at all times the lesser value of two sequentially 
examined exponents. The number of the exponent retained specifies the class of the input 
vector. This is available to be displayed, printed on film, or supplied to the computer for 
logging or subsequent processing. 

The cascades may also be used to process an increased number of distributions for a 
lesser number of channels. Thus for a 4 -channel source, such as ERTS, the operations of 
the various arithmetic units may be time-shared to provide 16 class decisions instead of 8 
class decisions for 8 channels. 

6.6 CLOCK 

This card has two main purposes — generating of timing pulses to control the 16 steps in 
the processing of a data vector, and controlling the loading of the RAMs during system setup. 

A block diagram is shown in Fig. 38. 

The timing pulses, or clock pulses, are derived from a master oscillator. A countdown 
is program-selectable in six steps providing clock pulse C0 at rates from 3.2 MHz to 100 kHz. 
Other clock signals are labeled Cl to C4 and Cfi to C4. 
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The signals Cl to C4 are used for classifier control, whereas C0 is used for loading 
latches at the end of a clock period. The circuitry can operate in a burst mode with or with- 
out reset to zero. The burst mode is used when operating the system using data coming from 
the computer. The clock can be set to count up from zero to 127. For a data vector to u? 
processed, sixteen steps are required. These are provided by setting the clock counter at 15, 
thereby allowing the sixteen steps 0 through 15. The clock is started by the computer when 
the computer has loaded the data vector into the second rank of the data register (latch "B" 
on the hybrid cards). The classifier processes this vector and stops until the computer has 
loaded a new data vector. If the computer loads a new vector first, however, the logic waits 
for the clock to finish before a new start signal is accepted by the clock. 

Control of RAM loading during system setup is accomplished by means of logic on the 
clock card. Timing pulses and routing control are provided (see Fig. 38) which transmit the 
contents to any selected RAM by replacing the clock pulses Cl through C4 with the low -order 
four bits of the address word. The addressing and loading of RAM constants is accomplished 
by the following sequence: 

(1) SEND WORD 1, DATA READY AND END CYCLE 

(2) UPON RECEIPT OF CYCLE REQ. SEND WORD 2 

(3) UPON RECEIPT OF CYCLE REQ. SEND NEW WORD 1 

(4) REPEAT 2 

The process stops when the word count register in the DR-1 IB increments to zero 

The two-word transfers of RAM coefficients are given below: 

WORD 1 CODE BAY RAM ADD 

11000 " 0 DDD DDDD DDDD 
MSB MSB MSB 

WORD 2 CODE RAM CONSTANT 

WT DDDDDDDDDDDD 
MSB 

NOTES: D is DATA - 0 or 1, as the case may be 

BAY is binary -coded 0 to 7 (only 0-3 are used) 

RAM is binary -coded: 
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0 

1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 


MEANS 
VARIANCE 
MTX 0 
MTX 1 
MTX 2 
MTX 3 
MTX 4 ) 

MTX 5 
MTX 6 
MTX 7 
k2 or CHI 
LN 


ALL 

BAYS 


f k2 in Bay 2 
\CHI in Bay 3 
(Bay 3 only) 


ADD is binary -coded 0-15 for internal address 
of a RAM. 


6.7 DIAGNOSTIC/OUTPUT SECTION 

The diagnostic /output (D/O) card is the prime output port for classifier -generated sig- 
nals. Inputs to the D/O may originate on the recognition card (5 -bit material code and/or 16- 
bit exponent value) or at any access to the classifier diagnostic bus. Outputs from the D/O 
card are presented as a 16 -bit word to the computer interface and as an 8 -bit word to a dis- 
play interface. 

The D/O card has two input registers for holding the values of material and exponent 
from the recognition card and one register for holding the diagnostic value (see Fig. 39). 
Contents of these three registers are combined and switched to the two output ports under 
control of the classifier clock and a computer -selected command word. 

The command word establishes the desired output formats and selects a time and place 
for sampling the diagnostic line. The command word is stored in a register and will command 
the same outputs to repeat twice each classifier cycle until a new command is sent. 

Available output formats are as follows. 

(1) At the computer interface: 

(a) the full 16 -bit exponent value 

or (b) eleven most significant exponent bits plus a 5 -bit material code 

or (c) 12-bit diagnostic plus 4 zeros 

or (d) a composite word, presented once each cycle, consisting of two 5 -bit material 
codes 
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FIGURE 39. BLOCK DIAGRAM OF DIAGNOSTIC/OUTPUT CARD 
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(2) At the display Interface: 

(e) the eight MSBs of the exponent 

or (f) the eight MSBs of the diagnostic 

or (g) 5 -bit material code 

Any combination of the above computer and display outputs may be selected by the appro- 
priate command except that (d) and (e) are not available simultaneously. The outputs are 
specified by the codes in Table 11. 

6.8 INTER-CARD-FILE WIRING 

The wiring between card files is accomplished in two main ways. The control file and 
the hybrid file use one technique, the classifier files another. A combination of the two is 
used in wiring the classifier files to the computer and other files. 

The first method uses four standard card slots in the hybrid and control files as connec- 
tors. Standard cards (Douglas or DEC boards) are not wired with ICs but rather with ribbon 
cable wired to the printed circuit connector lands. The ribbon cables from the computer's 
DR-1 IB were also wired to these cards. This scheme is outlined in the block diagram of 
Fig. 40. It allows each of these two bays to be unplugged from the system. On the control 
file one connector cables to the control panel, a second connector cables to the DR-1 1C inter- 
face, a third to the hybrid card file, and a fourth to the classifier for clock control. On the 
hybrid file two cards cable to a back panel containing BNC connectors for analog input and 
output, a third connector cables to the computer DR-11B and a fourth one to the control file. 

The classifier section of the system employs wire -wrap connectors, and some hand wire- 
wrapping was needed to get the data and control signals to each of the card files. The detailed 
wiring between the various files is shown in Fig. 41. In addition, the cabling between the en- 
tire classifier and the control file and second DR-11B in the computer was accomplished by 
using two DEC wiring blocks of the type used in the PDP-11/45 computer. The classifier was 
connected to these blocks with wire -wrap connections and standard DEC boards were plugged 
into the blocks. Classifier connections to the second DR-1 IB in the computer are detailed in 
Figs. 42 and 43. 

6.9 INTERIM DISPLAY AND RECORD FACILITY 

To provide a pictorial output showing system capability near the end of the first phase of 
the MIDAS development program, a subsystem consisting of several available instruments 
and parts was assembled. 

The display equipment consisted of a large-screen, long -persistence X-4 cathode ray 
oscilloscope, a dual -sweep oscilloscope to provide deflection voltages for the display, and 
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TABLE 11. CODE SELECTION FOR THE DIAGNOSTIC/ 
OUTPUT CARD 


CODE 


TO COMP 


TO DISPLAY 


10001 

000 MAT (4) 

000 

MAT 

01001 

000 MAT (4) 

000 

MAT 

10101 

11 -BIT EXP d) 

MAT 

01101 

11 -BIT EXP (2) 

MAT 

11101 

11 -BIT EXP @ 

MAT | 

10011 

DIAG 


0000 1 

01011 

DIAG 


0000 1 

11011 

DIAG 


_0000_1 

10111 

EXP 


J 

01111 

EXP 

11111 

EXP 


000 MAT | 
DIAG OQ 1 
000 MAT ' 
DIAG 0 
EXP (3) 
000 MAT 
DIAG CD 
EXP (3) 
000 MAT 1 
DIAG Cl) 
EXP (3) 


NOTES: 


(l) 6 most significant bits of 12 

(g) 11 most significant bits of 16 

(3) 8 most significant bits of 16 

® This output appears every second 
" process cycle; all other outputs 
appear every cycle. 
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(a) Sheet 1 

FIGURE 41. INTER-BAY WIRING (Continued) 
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various modular amplifiers to control display quality. The incoming signals for the display 
were the decoded and selected classification codes from the MIDAS classifier. 

Briefly, the display sweeps slowly down the large -screen CRO, drawing one horizontal 
line across the CRO for each line of the data source or record. During this sweep the CRO 
beam is modulated with the decoded classification code; this generates a picture in which 
those areas which are bright are those points (or pixels) classified as the selected code. 

The picture is held by the phosphor (P-7) for a time long enough to allow the image to be seen 
in a darkened room. 

At the same time he display is generated, the class code is decoded and the output lines 
(13) are adjusted in level (0, +1V) to allow recording on a multi -track recorder for subsequent 
printing on an available black and white film-strip printer. These signals and a sync signal 
are recorded on an instrumentation tape recorder employing the FM intermediate band mode 
so that the recording is compatible with the printing system. 

6.10 PHYSICAL DESCRIPTION 

The Phase-I MIDAS Classifier is housed in one 6 -ft rack assembly: a photograph of it is 
shown in Fig. 44. The power is single-phase, 120-volt AC with 20-ampere service. The sys- 
tem contains two power supplies — a 5 -volt, 80 -ampere supply and ±15 -volt, 3 -ampere supply. 
A matrix multiplier card has been measured as drawing about 1.4 amperes; therefore it 
would appear that all 52 wire -wrap cards draw about 70 amperes. A photograph of the four 
card files housing the wire-wrap cards is shown in Fig. 45. The physical location of the 
major components is diagrammed in Fig. 46. 
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(Rear View) 


FIGURE 46. PHYSICAL LOCATION OF MAJOR MIDAS COMPONENTS 
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Appendix A 

COMPARATIVE TEST OF MIDAS (PHASE I) PERFORMANCE 

Our test of MIDAS on an ERTS data-set was performed and compared with results ob- 
tained in -eparate processing of the same data-set using the IBM 7094. At the time of the 
comparison, the 7094 processing task conducted under the ERTS program was not yet com- 
plete, but its objectives and interim results nonetheless serve as background for the processed 
imagery examples submitted with the present report. The current status of the ERTS Yellow- 
stone project as recently reported by ERIM appears below in Section A.l. Following this, 
Section A. 2 describes the MIDAS Phase I procedures employed in processing a quarter- 
section of the same ERTS Yellowstone data to obtain an imagery sample for comparison 
purposes. 

A.l YELLOWSTONE PARK DATA 

A fundamental task of the National Park Service (N.P.S.) under Yellowstone’s new Master 
Plan for park management is that of assigning geographic areas within the park their optimum 
role in meeting important multiple -use needs in accordance with established criteria for pro- 
tection of the environment. Although specific management measures for accomplishing this 
goal are to be identified through ’’mission oriented” research, their implementation is certain 
to require an organized body of data on the type and distribution of natural resources through- 
out the park. 

An objective of this project is to obtain, from ERTS -A MSS data, detailed vegetation and 
terrain feature maps for Yellowstone suitable for these many purposes. Completion of these 
maps will mark the first time a park -wide environmental inventory of Yellowstone has been 
available as a basis for management planning. They will provide not only an up-to-date record 
of conditions within the park, but will, in addition, serve as quantitative baseline data for 
monitoring natural ecosystem development ?nd the ecological consequences of management 
decisions over extended periods of time. 

The project is a cooperative one between ERIM, Dr. Harry Smedes of U.S.G.S. and Mr. 

Don DeSpain of N.P.S. 

A. 1.1 PRELIMINARY 5 -CATEGORY COVER-TYPE RECOGNITION MAP 

Computer -compatible tapes (CCTs) of bulk -processed MSS data for the 6 August 1972 
(frame 1015-17404) overflight of the test site were received at ERIM in early December. 

The decision to process this data reflected both the fact that this orbit marked the first cloud- 
free observation of the park, and, from an ecological standpoint, occurred during a period of 
d stinctive seasonal differences in the appearance of various types of vegetation cover. 
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ERIM’s initial task consisted of converting the data on the ERTS tapes to a format com- 
patible with in-house software programs, and subsequently generating a digital graymap of 
MSS 5 (red band) for the entire park. A copy of this map was sent to Dr. Smedes in mid- 
December, 1972, for his use in transcribing cover-type training set locations onto a common 
base. A list of these training sets, along with their respective environmental parameters and 
graymap coordinates, was transmitted to ERIM by Dr. Smedes in January, 1973. ERIM then 
began an extensive program centered around the extraction and analysis of multispectral 
statistical signatures for the training sets, which comprised our effort for February. Based 
on the environmental complexity of cover types observed during this period, a decision was 
made to initially produce a basic, 5 -category map representing the endpoints of important 
spectral and environmental cover type conti nua . We reasoned that it was important initially 
to determine the level of recognition accuracy we could attain using spectra) categories which 
were easily distinguishable. 

The 5 -category map was produced in early March and reported in a paper, "Terrain 
Classification Maps of Yellowstone National Park," given at the GSFC during the March 5-9 
ERTS Symposium [6], An analysis of recognition accuracy for the five categories resulted in 
the following hierarchical ranking: 

(1) water 

(2) coniferous forest 

(3) light rock 

(4) grasslands 

(5) wet meadows 

Estimates of recognition success based on percent area correctly classified indicate an overall 
map accuracy greater than 82 percent. 

A. 1.2. FINAL COVER-TYPE RECOGNITION MAP 

In an attempt to resolve the causes underlying poorer recognition associated with the 
grassland and lowland meadow types of the 5 -category map, to refine these and the other 
categories, and to enlarge upon the total number of training sets, Mr. Ralph Root, a Colorado 
State graduate working for Dr« Smedes, spent a week at ERIM in April. During his visit, high- 
altitude RB-57 color and color infrared photography of the park was examined on a VARISCAN, 
The optical enlarging capability of the VARISCAN permitted us to closely match scales be- 
tween a scene viewed on the film and the graymaps. Not only was identifying an appropriate 
training set on the film much easier, but transcription of this location onto the graymaps was 
far more accurate. 
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As a result of this analysis, several of the original training sets, especially those in the 
grasslands and lowland meadows categories, were found to be less homogeneous than previously 
believed. This appears to be the major explanation for the poorer recognition of these types 
in the preliminary map. 


On this premise, we proceeded with another program of signature extraction and analysis 
for refined versions of original training sets and some representing several new categories. 
This work occupied our efforts for May. The results of our findings at the end of this period, 
based solely on training set signature separation, indicated that we now had 18 potential cate- 
gories for inclusion in the final map. These categories were: 


1. Lodgepole pine (young -dense type) 

2. Lodgepole pine (mature -more open type) 

3. Douglas fir 

4. Spruce fir 

5. 30-45% conifer cover over rock rubble 

6. 30-45% conifer cover over grass 

7. 15-30% conifer cover over rock rubble 

8. 15-30% conifer cover over grass 

9. Wetland shrubs 


10. Upland grass 

11. Lowland grass 

12. Grass -brush mixture 

13. Brush 

14. Dark rock 

15. Light rock 

16. Thermal deposits 

17. Alpine meadows 

18. Water 


Recognition maps of these 18 cover types were generated for restricted test areas within 
the park and sent to Dr. Smedes for his evaluation of their accuracy. 

Based on Dr. Smedes’ and Mr. Root’s findings, several steps were taken preparatory to 
more extensive mapping. First, the category alpine meadow was dropped because of excessive 
recognition commission errors as light rock. Similarly, a disappointing number of recognition 
commission errors between categories of morphologically similar vegetation types prompted 
us to combine several similar categories. Included in this situation were the merging of the 
two Lodgepole pine types, Douglas fir and Spruce fir categories back into a single coniferous 
forest type, and merging of the upland and lowland grass types into a single grasslands 
category. 

In the case of the forest species, it is possible that many of the recognition commission 
errors resulted from varying signal contributions of path radiance in MSS 4 and 5. Compari- 
son of signal means in these channels for the various forest species (Table A.l) reveals they 
are rarely more than 1 to 2 data values apart. By our estimations, path radiance at critical 
altitudes within the park was at least this great. This was determined by comparison of 
MSS 4 signal levels for deep-pure water from two lakes at elevations of 6000-7000 ft. A sig- 
nal increase of 1.11 data values attributable to path radiance was observed for the water sig- 
nature at the lower elevation. 
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Recognition processing of the entire park was completed in June, using the following 13 
categories: 


1. Coniferous forest 

2. 30 -45 r 7 conifer cover over rock rubble 

3. 30-45 f y conifer cover over grass 

4. 15 -30 f 7 conifer cover over rock rubble 

5. 15 -30^, conifer cover over grass 

6. Wetland shrubs 

7. Grasslands 


8. Grass -brush mixture 

9. Brush 

10. Dark rock 

11. Light rock 

12. Thermal deposits 

13. Water 


The classification output was then mapped and sent to Dr. Smedes and Mr. DeSpain. 


Dr. Smedes. Mr. Root, and Mr. DeSpain are currently concentrating on evaluating the 
recognition accuracy of this latest product. Additionally, Mr. DeSpain is beginning an assess 
ment of its impact on National Park Service management -decision processes. 


A. 1,3 CONCLUSIONS 

This project is expected to contribute much new ecological knowledge about the Yellow- 
stone landscape. In particular, the revegetation patterns of old wildfire burns are being 
analyzed to determine successional trends occurring in the park. This information should 
have a significant impact on the Park Service’s fire protection and wildlife management 
programs. 

On a more general level, we are working closely with Mr, DeSpain to determine if there 
are more desirable, or additional formats in which to present data abstracted from the final 
map in terms of making it a more effective part of or basis for the ’’mission oriented” re- 
search involving formulation of Yellowstone’s future management policies. 

One format we are currently experimenting with is the production of thematic maps. 

Small examples of these have been made illustrating important ecological cover type continua , 
e.g., forest to bare rock, and forest to grassland. This type of information on vegetation type 
boundaries and transitions has considerable value in the management of ’’edge” species, i.e., 
plants or animals that depend entirely, or in part, on the ecological character of a transition 
zone between major cover types for their existence. 

We are also interested in investigating a method of automatically computing from the 
map data the acreage of the various cover types and amount of edge within specified corridors 
where a land use shift has been proposed. This capability would provide the Park Service 
with a means of obtaining a timely, specific, and detailed data base for assessing the environ- 
mental impact of proposed management alternatives. 
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A. 1.4 RECOMMENDATIONS 

Many of the most important cover types have signature means in MSS 4 and 5 separated 
by only 2 percent of the sensor's dynamic range, i.e., 2-3 data values in 127 (Tables A.l and 
A. 2). Excepting thermal deposits, which appear essentially white, the brightest natural ob- 
jects in the scene rarely exceed the data value of 50. This represents only 40 percent of the 
dynamic range and indicates that over one-half the scanner’s sensitivity is not being used. 

Since the danger of clipping important data appears minimal, we believe a pass over the 
park exercising the high gain option is highly desirable. We anticipate that this data will be 
extremely useful in assessing the potential separability of important forest species. Using 
this option the data will be expanded 3 -fold as compared to MSS 4 and 5. m effect, the data 
takes on more significance because now there will be a greater number of quanta within the 
relative separation of the signatures. 

It would also appear that preprocessing the data to remove the effects of path radiance 
as experienced in MSS 4 and 5 is essential to consistent recognition of forest types that occur 
in interspersion over a wide range of base altitudes. 

A. 2 MIDAS PHASE I PROCESSING PROCEDURES 

The following software procedures were implemented in carrying out the Phase I demon- 
stration of MIDAS software capability on the ERTS Yellowstone National Park data-set re- 
corded on 6 August 1972. 

(1) Program DISPLAY was employed to select from this data set one quarter-frame 
which would be most representative for demonstration purposes. Quarter-frame 3 
was determined to be the most suitable area in view of distinct surface features 
which would be clearly identifiable on the limited display facilities of the Phase I 
MIDAS. 

(2) Since the Phase I signature-extraction facility was not operational as of 1 November 
1973, suitable signatures for use with the Yellowstone data-set were obtained on the 
IBM 7094 data processing facility, employing the identical algorithms specified in 
this report for the PDP-11/45 software system. These signatures were subsequently 
used to obtain a multiple -category classification of the ERTS Yellowstone data-set 
comparable to that described in the preceding section. In all, 15 signatures were 
formulated by combining signatures of homogeneous areas over several regions of 
the entire data-set. Program SIGCVT performed the transformations necessary 

and created a disk file of the proper format which provided an input file for subse- 
quent processing. 
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TABLE A. 1. TRAINING-SET SIGNATURES OF CONIFEROUS FOREST COVER TYPES 
FOR ERTS-A YELLOWSTONE PARK MSS DATA, 6 AUGUST 1972 


Training Set 


Signature Mean and Standard Deviation 



MSS 4* 

MSS 5* 

MSS 6 r 

MSS 7 f 

Lodgepole Pine (young) 

17.23 ± 1.01 

12.79 ± 0.98 

22.07 ± 1.51 

11.76 ± 0.94 

Lodgepole Pine (mature) 

16.18 ± 0.88 

11.01 ± 0.80 

21.71 i 1.37 

11.46 ± 0.72 

Douglas Fir 

16.67 ± 1.00 

10.89 ± 1.05 

22.26 ± 1.80 

11.82 ± 1.61 

Spruce - Fir 

15.58 ± 0.92 

10.09 ± 1.12 

20.01 t 2.22 

10.64 ± 1.42 


TABLE A. 2. TRAINING-SET SIGNATURES OF MAJOR TERRAIN COVER TYPES FOR 
ERTS-A YELLOWSTONE PARK MSS DATA, 6 AUGUST 1972 

Training Set 


Signature Mean and Standard Deviation 



MSS 4* 

MSS 5* 

MSS 6* 

MSS 7 t 

Coniferous Forest 

17.93 ± 1.96 

13.45 ± 2.67 

23.54 ± 3.28 

12.61 ± 1.98 

30-45% Conifer cover 
over rock rubble 

19.15 i 2.03 

15.81 ± 2.74 

27.11 ± 3.95 

14.87 ± 2.30 

30-45%, Conifer cover 
over grass 

18.72 ± 1.33 

13.50 ± 1.46 

31.90 ± 6.26 

18.36 ± 4.02 

15-30%, Conifer cover 
over rock rubble 

25.68 ± 2.97 

25.07 ± 4.09 

37.07 ± 4.31 

19.52 ± 2.15 

15-30%, Conifer cover 
over grass 

21.24 ± 2.18 

16.93 ± 2.99 

40.74 ± 6.05 

23,74 ± 3.88 

Wetland Shrubs 

19.33 ± 1.05 

13.38 ± 1.31 

43.87 ± 3.81 

26.35 ± 2.98 

Grasslands 

22.64 ± 1.55 

18.52 ± 1.55 

47.62 ± 5.11 

29.17 ± 3.83 

Grass/Brush 

24.17 ± 1.29 

21.72 ± 2.30 

37.97 ± 3.52 

21.52 ± 2.59 

Brush 

26.32 ± 1.50 

25.28 ± 1.88 

31.30 ± 2.78 

15.64 ± 1.79 

Dark Rock 

27.72 ± 4.42 

26.41 ± 5.32 

27.24 ± 3.80 

12.51 ± 2.05 

Light Rock 

37.62 ± 8.29 

40.40 ± 11.14 

43.57 ± 10.83 

19.78 ± 5.54 

Thermal Deposits 

65.48 ± 9.69 

72.52 ± 11.23 

71.00 ± 9.31 

31.58 ± 3.60 

Water 

13.65 ± 0.79 

5.85 ± 0.87 

2.99 ± 0.70 

0.27 ± 0.45 


^Current potential range of 127 data values. 
^Current potential range of 64 data values. 
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(3) Operating on the data file created by SIGCVT, program SCALER performed the 
mathematical manipulations necessary to obtain constants in a form suitable for 
loading into the MIDAS classifier coefficient RAMs. SCALER uses the mean vector 
and covariance matrix to generate this set of constants — which play a key role in 
implementing the maximum -likelihood rule. (See Section 3.2.5 on the algorithms 
used.) 

(4) Once these constants were generated via SCALER, program RAMLD allowed the user 
to select a subset of signatures for actual loading into classifier RAMs. This subset 
selected, it was necessary to normalize certain constants within the subset, viz., 

the normalization constant for the decomposed inverse correlation matrix, and the 
additive constant specifying the natural logarithm of the determinant of the covari- 
ance matrix. After these constants had been calculated, all of the coefficients derived 
from the signatures were loaded in several large DMA transfers via the DR-1 IB 
general Interface. 

Next, the user specified an upper -limit for the chi -squared value arrived for each 
resolution element. This chi-squared test, comprising the last overflow test per- 
formed by the hardware, acts as upper bound on the maximum -likelihood criterion. 

(5) With the signature coefficients loaded into the classifier, program CLAS1 set up the 
parameters for the classification procedure to be perform <; next in the logical 
operational sequence. The user now specified, via console keyboard, the region to 
be classified; designated the I/O media to be used; and specified the output format 
from the classifier. This information was stored in a temporary data -file on disk, 
and a program CLAS2D was loaded into core and executed. To control the classifica- 
tion procedure, program CLAS2D used the temporary disk file created by CLAS1. 
After supervising the initiation of the classification process and controlling the sub- 
sequent flow of data both in and out, CLAS2D returned control to CLAS1 for input of 
new classification parameters by the user. 

(6) Upon completing the classification process, we again called upon program DISPLAY 
to generate grayscale maps exhibiting the classification results. Here, because the 
line printer is only intermittently available and slower than the CRT, we employed 
the latter as the output device. 
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Appendix B 

RAM LOADING REQUIREMENTS 

Loading of coefficients into the RAMs is somewhat Involved because of the machine’s 
pipeline nature. Also, it appears that two programs may have to be written, one for 4 -channel 
and one for 8 -channel operation. At this time, however, it appears that only the 4 -channel 
case need be considered since that is what we use for the Phase I demonstration. Basically, 
there are three sets of coefficients to be loaded: (1) a -#i vector, (2) a 1/a vector, and (3) a 
U -Square matrix oi eigenvectors. 

To load the RAMs requires a 12-bit address located as follows in the computer word: 


internal 

bay card RAM address 


— 



— 

f — 

— ^ 


f — 


11 

10 9 


Ll 

6 5 

I±] 

[ E 

2 1 0 


The bay and card bits are relatively straightforward except that the Internal RAM address is 
more complex. Up to 18 materials, designated by the letters A,B,C,D,E,F,H,J,K.L,M,N,P,R, 
S,and T, can be stored In the processor. (These letters correspond to standard wiring let- 
ters — including DEC’s.) To load the means (-p vectors) for letters A,3,C, and D, the 
address codes are 


= 00001000010000 first element 

-li r = 00101000010000 

00001000010001 

L 0010 10000! 0001 

00001000010010 

00101000010010 

0000; 0000 100 11 fourU element 

00101000010011 

= 00011000010000 

0011 10000 10000 

00011000010001 

D 0011 10000 10001 

00011000010010 

00111000010010 

0001 0000! 0011 

00111000010011 


For the next set of four -4 vectors, we have, first 
-JT e 00001000010100 to 0111 


Then similarly, for the other three materials 
-/Ip - 0001 000010100 to 0111 
-iI H 00101000010100 to 0111 
-jT, -- 00101000010100 to 0111 
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In the next set of four vectors, we use high -address numbers in the internal RAM address 
-^ K = 0000 10000 1 1000 to 1011 
down to 


-M n = 0011)00001 1000 to 1011 

The last set of materials is 

-|T p = 0000 1 0000 11100 to 1111 
down to 


-iT T = 0011100001 1100 to nil 


The next vector to be loaded is the 1/a vector. These are loaded in almost the same 
way as the mean vector with this exception: We add 1 to the least significant bit in the inter- 
nal RAM address to compensate for the pipeline nature of the processor. Thus, 


1/a = 0000 1 0001 1 0001 first element 

A 0000 1 0001 | 0010 
000010001)0011 

0000 1 000 1)0100 f ou rth element 

lTa = 0001 ) 0001)0001 
B 0001 ) 0001)0010 
0001 ) 0001)0011 
0001 ) 0001)0100 


and so on until we come to the las’: set of four materials P through T. Here 


l/a_ = 0000 1 0001 1 1101 
V 0000 1 0001 1 1110 
0000100011 1111 
0000 1 0001 1 0000 


l/a_ = 0011 1 0001 1 1101 first element 
00111000111110 
0011 1 0001 1 1111 

0011 1 0001 1 0000 fourth element 

Next to be loaded are the U -square matrix coefficients. For these, there are eight 
• cards, each designed to hold a row of coefficients for an 8 x 8 matrix. 

To store four 4x4 matrices on these cards, only half the amount of coefficient storage 
is needed. The classifier is organized so that the coefficients of four 4x4 matrices can be 
stored in the two 8x8 matrices as follows: 
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0 

1 

Matrix Mult. ^ 
Card No. J 

Material 

0 

E 

(zeroes) 

0 

P 

in A 

Bay 0 5 

6 

A 

0 

K 

0 

7 






2 3 4 5 

6 7 8 9 

10 11 12 13 

14 15 0 1 


V ^ ' 

Internal Address Number 


Bay 0 is selected by the 0000 code in bits (11,8). Matrix multiplier 0 is selected by the 
0010 code in bits (7,4) and each succeeding matrix multiplier is selected in turn by advancing 
the code by 1 up to number 7, code 1001. Note the permutation of the internal RAM address 
by 2. The other bays employ the following code for loading material coefficients: 

bay 1 
code 0001 

bay 2 
code 0010 

bay 3 
code 0011 


0 

F 

0 

R 

B 

0 

L 

0 

0 

H 

T 

S 

C 

0 

M 

0 

0 

J 

JL 

T 

D 

0 

N 

0 


In the configuration using eight 8x8 matrices, the material coefficients would be stored as 


bay 0 

A E 

bay 1 

b| f| 

bay 2 | 

rci hi 

bay 3 | 

dI Jl 


Two other sets of coefficients are to be stored: (1) a set of normalizing coefficients k , 

o 

and (2) the log of the determinants of the covariance matrices. The k set is addressed by 


0010 

bay 


1010 

card 


xxxx 

material number 


The materials are assigned internal RAM addresses as follows: 

A 1011 
B 1100 
C 1101 
D 1110 
E 1111 
F 0000 
H 0001 
J 0010 
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V 


1 

I 


H 



L 0100 
M0101 
N 0110 
P 0111 
R 1000 
S 1001 
T 1010 

The log of the determinants is addressed by 

0011 1011 XXXX 

bay card material number 

Here, the material numbers are different from those above in that a 1 has to be added, thus 

A 1100 
down to 
F 0001 
and 
T 1011 

2 

One other constant for the Chi test has to be loaded at address 

0011 1010 ( ma t er j a i num ber not needed) 
bay card 



j 


J 
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Appendix C 

SPECIFICATION OF OUTPUT FILE FROM SIGNATURE PROGRAM 
C.l SIG AND SIGCVT OUTPUT 

The following provides a description of the output file generated by programs SIG and 
SIGCVT. 

(1) The file must be a contiguous DOS -file. 

(2) Structure: 

BLOCK 1 

Word 1 : 

Word 2 : 

Word 3 

BLOCK 2 etc. 

Word 1-3 : 

Successive 
Words : 

Successive 
Words : 


NOTE: Start each new signature on a new block. For example, if we 
have a 12 -channel signature, then we have the following num- 
bers of words used: 

3 Signature ID 

48 Mean vector (12 elements x 4 words/ element) 

312 Cov. matrix (78 elements * 4 words/element) 

363 Total number of words (two blocks/signature) 

Thus, the first signature begins on block 2, the second on block 4, 
the third on block etc. 

C.2 NOTES ON DATA PRESENTATION TO *WIREWRAP 
(1) An overview: 

Passing data to *WIREWRAP involves three things — what, where, and connections, 
(a) What — What kind of IC or module does the user now want to put on the board? 

The user specifies this by calling in a macro description of the IC he has in 
mind. He does this by simply giving the name of the macro. *WIREWRAP then 
looks in the user’s macro list until it finds the right one, and then is ready to 
accept more information. 
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No. of signatures in the file 
No. of channels (NCHAN) 

No. of words used per signature 


6 -character ID (ASCII) 

Mean vector of NCHAN elements (REAL*8, i.e., 
4 words/element) 

Covariance matrix (lower half), A, of 
(NCHAN+1)* N/2 elements (REAL*8) such that 
the first element passed is Ajj, the next A 22 , 
then A 22 , A 31 , A^. etc. 


i 

i 


i 


i 


i 


1 
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In short, macros are used to pass information statistics to *WIREWRAP 
and to implement the conversion from a net-nams into a pin number. 

Here, as examples, are two forms of macro description for a hex inverter 
module: 

I: MACRO 

INVERTER MODULE=SN74£l4 POWER=44 COST=25 TAG=14 
ASSIGN NETS=&INPUT PINS=(l,3,5,9,ll,13) LOAD=l 
ASSIGN NETS=&OUTPUT PINS=(2,4,6,8,lGf,12) DRIVE=10 
ASSIGN NETS=&POWER PINS=(7,14) 

MEND 

II: MACRO 
INV TAG=14 

ASGN NETS=&P PINS^l^S^.S.ej.e.Md.lMZ.lS.M) 

MEND 

The words "MACRO" and ’’MEND” must accompany each macro description. 

1. In the first case above, the name of the module is "INVERTER"; in the 
second it is "INV." 

2. The manufacturer's model number of the first is "SN7404," and the 
second is unspecified. 

3. The first module consumes 44mW: for the second this is unspecified 
(default fi). 

4. The first module costs 25 cents; the second is unspecified (default tf). 

5. Both are known to have 14 pins (present routines require the "TAG=" 
to be present). 

6. For the first module it is known that the unit load of the inputs is 1, 
that its output fanout is lp, and that its power pins (ground, +5 volts) 
are 7 and 14 respectively; in the second case this information is 
unspecified. 

The order of appearance of COST, POWER, TAG, MODULE is not significant. 
Both macros I and II are valid, but I is more logically apparent to the user, and 
he is less likely to make mistakes with it. ("ASGN" is "ASSIGN" in short form.) 

In I, the names "INPUT," "OUTPUT," "POWFF" are known as logical net 
names and are distinguished by the which precedes them. They can be 8 or 
less characters long. 

(b) Where — Where does the user want this IC or module placed on the board? 
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*WIREWRAP automatically starts placing modules on the board in locations 
0,1, 2,3, 4, . . . unless otherwise instructed. 

The ”$POSITION nn” command instructs *WIREWRAP to place the next 
module at location nn, and unless told not to, the next one after that at position 
(nn+1), then (nn+2), etc. 

If the user wished to place hex inverters, for example, in locations 0, 1,3,5, 
48, 49*, the following would accomplish that: 

INV 

INV 

$ POSITION 3 
INV 

SPOSITION 5 
INV 

$ POSITION 48 

INV 

INV 

If necessary, these positions can be thought of as pin locators, not IC seats. For 
example, if a 3 -pin crystal clock is being used, it could be inserted into several 
different empty module positions to allow for its widely spaced pins. The output 
would then think that there had been several strange ICs inserted, instead of one 
unique device. 

In short, unless $POSITION’ed, *WIREWRAP will get its new position by 
adding 1 to its old position. 

(c) Connections — How does the user connect the IC pins? 

Since each logical net -name is associated with a group of pin numbers, all 
that has to be done is to associate the names of the pins with the logical net- 
names, A pin’s name is called its net -name . 

Any number of pins which receive the same net -name from the data become 
wired together. The net -name might be thought of as a signal name — any pins 
labeled with that name must have that name’s signal value. Of course there can 
be only one name per pin. 

For example, take the ’’INVERTER” again. Since ’’INPUT” is associated 
with the input pins in the order 1,3,5,9,11,13, we can assign any or all of those 
pins a name by using ’’INPUT.” Suppose now that we want the name ”XX” to be 


*NOTE: For the user to know, for example, exactly where location 48 is, he must have a 
board layout diagram. In the Generalized Augat URG-1, layout locations 0-49 are for 14- or 
16-pin ICs only, 50-67 are for 24-pin ICs only, 68-75 are for I/O, 76-81 are for Ground, and 
82-87 for +5 volts (VCC). 
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on pin 1 and "ZERO" on pin 3. Then, INPUT=XX,ZERO would accomplish this. 
What about pins 5,9,11,13? Since they are not called or specified, they are not 
wired. 

Suppose further that we only wanted to wire pins 9 and 13 on another SN7404 
inverter. Then INPUT* ,„XX„ZERO would be the data. Notice that in the first 
case, trailing commas were not necessary. *WIREWRAP detects the end of the 
list by seeing a blank at the end. In the second case we need three commas as 
null-element spacers, then the name "XX” followed by its comma, followed by a 
null-element comma for pin 11, then the name "ZERO" for pin 13. 

There can be no blanks internal to an assignment statement. Therefore the 
following would be illegal : INPUT* AA, BB, CC, , However, there can be many 
blanks between different assignment statements. For example, suppose we 
wanted pin 1 to be AA, pin 2 to be BA, pin 7 to be G114, and pin 14 to be V54, then 

POWER=G114,V54 OUTPUT*BA INPUT=AA 

The order is insignificant as far as assignment statements go, but the order of 
the net -names in the assignment statements is one-to-one with the pin numbers 
in the marcofor that given logical net name. Logical net -names may be dupli- 
cated in different macros, e.g., we used M &PWR” in ail of them, but net -names 
can be presented in the data only on pins which are to be wired together. 

Notice that in the marco description the logical net -name has an ampersand 
(&) preceding it, but in the data the ampersand disappears. What happens if 
some data runs over one card? It is then necessary to use a dash ( — ) in 
column 80 to indicate continuation to the next card. (Recall that there can be 
no blanks inside of an assignment statement.) Thus if the data spills over, 
column 1 on the next card is considered column 80 (the dash disappears), and 
column 2 is column 81, etc. 

Example: 

. . .’TNPUT=ZERO, M . . .becomes: 

Column 


73 

74 

75 

76 

77 

78 

79 

80 

li| 

2 

3 

4 

i 

N 

P 

u 

T 

= 

Z 

- 

[e 

R 

0_ 


Card 2 | 


And there is nothing wrong with this occurrence: 
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. . . M INPUT=ZERO M becomes: 
Column 


71 

72 

73 

74 

75 

76 

77 

78 

79 

80 

I 

N 

_P 

U 

T 


Z 

E 

R 

O 


if ’’ZERO" is the last net name in that marco(IC) to be assigned. 

(2) A specific example: 

Two inverters might be actually wired as in Fig. C.l. Call the first inverter BA, 
and the second one BB. We want BA in position 17 and BB at 18. 


L 


o 

2 

< 

W 

S 


67 54 53 52 


’T 


1 3 5 7 11 13 

7404 BA 17 

2 4 6 8 10 12 


Connector 


44 


0 


1 3 5 9 11 13 
7404 BB 18 

2 4 6 8 10 12 


FIGURE C.l. SAMPLE INVERTER WIRING 


$ POSITION 17 

INVERTER INPUT=MEANOF,GBA,P54,P53,P52,P52 OUTPUT=BA2,BA4, BA6, BA 8, 
BAld,BA12 POWER=GBA,VBA 

INVERTER OUTPUT=BB2„BB6,,BB1|0 POWER=GBB,VBB INPUT=BA10„GBB,,BA12 

Notice that the marconame comes first, then one of the assignment statements. Also 
it is not necessary to mention all of the logical net names if they are not all used. If 
we wished somehow to hand -wire the power pins, we could just not mention "POWERS 

and pins 7 and 14 would then be left free. 
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* 


Also notice in the example above that many names are singular, such as ”VBB.” 
If these were never mentioned before or again in the data they would not be wired, 
and would appear in the output in a ’’Singles List.” Frequently this catches mis- 
punched cards. 


A good way to generate meaningful net -names is to give each chip a letter code 
for its type, then a letter for its number (”BB ft is the 2nd inverter, ”MF” is the 6th 
multiplier, ”LA” is the first latch, etc), then name pins by their source output pins. 
For example, the 10th pin on BA is BA10 because it is an output pin, then pin 1 on 
BB is BA10 because it is connected to the 10th pin of BA. 

Also, ”GBA” and ”VBA” are the ground and voltage pins of BA respectively, and 
pin 54 on the connector becomes ”P54.” Pin 67 on the connector is an important 
control signal, so it gets a mnemonic name ’’MEANOF.” 


(3) Running *WIREWRAP on MTS at U of M Computing Center 

(a) For your 5SIGNON card, the TIME=12 and PAGES=25 CARDS=XXXX* 

(b) Next you need a $RUN *WIREWRAP+Subs+W064:SQGRD 

where "Subs" Is the name of your compiled object subroutine package, and 
W064:SQGRD is the system assembly program that punches the cards. 

(c) Next comes the $SET command where you can choose your options (see the MTS 
manual writeup). Some of the more useful ones are (default underlined): 


PUNCH=ON/OFF 
ABORT =ON/OFF 
DEBU G=ON/ OFF 

LEADLEN-n/0 

WRAPCOST=n/0 

WIRECOST=n/d 

XREF=ON/OFF 


controls production of Gardner -Denver deck 

if ON, prevents punching a deck in case of error 

if ON, prevents costly extras, like ordering of the 
pins, etc. (inhibits punching) 

n=ll (1/10 inch per wrap of wire) 

n=5 (£ per wrap) 

n=l (<? per foot of wire) 

cross-reference is not so useful after data is de- 
bugged. 


$SET uses no commas: ex: $SET BAYS-7 BAYCOST-26650 PUNCH-ON 


(d) STITLE, followed by a card containing the title you want to use 

(e) $DATA indicating the start of data 

(f) Then the Macxo description. (*W1REWRAP won’t know what ’’INVERTER” is 

until it has e .countered it in a macro list.)^ 

*If you intend to produce cards you must specify CAF£*>XXXX, where XXXX is just an 
upper limit on the number of cards you expect, and is typically less than 1000. 

^Frequently it is convenient to have your bunch of macros in a file (say, ’’MACROS”) and 
have ”$CONTINUE WITH MACROS RETURN"’ just after your $DATA card. Curiously, the 
Macro list is searched last first, so the most used macros should go at the end. 
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(g) Wiring data, including SPOSITION cards 

(h) SEND signaling the end of data 

(i) SENDFILE 

( 3 ) If an asterisk (* ) occurs in column 1, the card becomes a comment card. 

Suppose that your subroutine package is in a file named "SUBS," already compiled, 
and that your macro list is in a file named "MACROS.” 

Also suppose that you plan to hand-wire your power connections and your I/O, 
and that you want an inverter at position 36, with one line on pins 2 to 3 called 
"ONLY," and the title of the device is "SMALL TEST." The following would be a 
complete deck which would punch one Gardner -Denver machine card: 

SSIGNON CCNO T-12 P-25 CARDS-1000 'TEST DECK’ 

Password 

$RUN *WIREWRAP+SUBS+W064:SQGRD 

SSET BAYS-1 BAYCO6T-8800 LEADLEN-11 WIRECOST-1 

$SET WRAPCOST-5 ABORT-ON PUNCH-ON 

STITLE 

SMALL TEST 

Sdata 

SCONTINUE WITH MACROS RETURN 
$ POSITION 36 

* HERE IS YOUR ONLY DATA LINE: 

INVERTER INPUT-, ONLY OUTPUT -ONLY 

SEND 

SENDFILE 

(4) Output Interpretation 

(a) Source Listing — the input deck is listed (including the Macro file if one is 
SCONTINUE’ed). When a macro is called, that device is assigned an element 
number (El, E2, E3, . . .) by *WIREWRAP, then it is assigned a location (either 
by adding one to its last position or by being SPOSITION'ed). The error coun* 
given at the end applies only to *WIREWRAP syntactical errors, not mispunches 
or wiring errors. 

(b) Net-Name Cross Reference — each net -name is listed alphabetically along >vith 
the input deck line number in which it is used (once per use). 

(c) Net-Name Dictionary — each net -name is listed alphabetically followed bv 

\ an "0" if any two output pins with that net -name are collector OR’ed (hard- 
wire OR) 

2 a "* " if any output pin with that net -name is overloaded 

3 the sum of the normalized loads of the input pins with that net -name 

4 the normalized drive (fanout) of the l owest -drive output pin with that 
net -name 
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5 the list of pins with that net -name. Output pins are underlined. The pin 
names used are the output of WW4 and are also found on the interpreted 
Gardner -Denver machine deck. In general, the first 5 characters in the 
name give its layout position and the last 3 characters give thr physical 
board location. 

Since *WIREWRAP wraps pins serially (daisy-chain) U determines in 
which order to wrap a certain group of pins (those logically related by 
possessing the same net-name) so that the minimum amount of wire is used. 
If the DEBUG option is OFF, the Net -Name Dictionary listing of pins will be 
in the same order as will actually be wrapped on the board. 

The load and drive information is obtained from the marco description 
A pin is considered to be an output pin if in the macro it has the ’’DRIVE ” 
parameter. The default for LOAD and DRIVE is zero. In the Net-Name 
Dictionary a drive of 0 is assumed to be like an output or power pin so it 
won’t call a non-zero load an overload. 

(d) Module Dictionary — Totals the number and cost of all modules (macros) used. 

(e) Itemized Cost Analysis — Totals the cost of the device. 

(f) Documentation— This is the output of WW6, labeled "***SUMMARY OF PIN 
CONNECTIONS,” which documents which net -names were actually assigned 
where. 

(g) Statistics — Lists things like total power consumption, total feet of wire, etc. 

(h) Singles — Lists the net -names which were only assigned to one pin. Presence 
of a Singles List indicates a wire data error — usually a mispunching but some- 
times a conflict in naming-conventions. The list is deleted if no singles are 
discovered. 
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